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Abstract 

The theory of exact and of approximate solutions for non-autonomous linear differ- 
ential equations forms a wide field with strong ties to physics and applied problems. 
This paper is meant as a stepping stone for an exploration of this long-established 
theme, through the tinted glasses of a (Hopf and Rota-Baxter) algebraic point of view. 
By reviewing, reformulating and strengthening known results, we give evidence for the 
claim that the use of Hopf algebra allows for a refined analysis of differential equations. 
We revisit the renowned Campbell-Baker-Hausdorff-Dynkin formula by the modern 
approach involving Lie idempotents. Approximate solutions to differential equations 
involve, on the one hand, series of iterated integrals solving the corresponding integral 
equations; on the other hand, exponential solutions. Equating those solutions yields 
identities among products of iterated Riemann integrals. Now, the Riemann integral 
satisfies the integration-by-parts rule with the Leibniz rule for derivations as its part- 
ner; and skewderivations generalize derivations. Thus we seek an algebraic theory of 
integration, with the Rota-Baxter relation replacing the classical rule. The methods to 
deal with noncommutativity are especially highlighted. We find new identities, allow- 
ing for an extensive embedding of Dyson-Chen series of time- or path-ordered products 
(of generalized integration operators); of the corresponding Magnus expansion; and of 
their relations, into the unified algebraic setting of Rota-Baxter maps and their inverse 
skewderivations. This picture clarifies the approximate solutions to generalized integral 
equations corresponding to non-autonomous linear (skew) differential equations. 
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1 Aim, plan of the article and preliminaries 



This paper studies non-autonomous differential equations of the general type 

g(t)g-\t) = m, g(t ) = i G or (1.1) 

g~\t)g{t) = r)(t), g(t Q ) = 1 G , (1.2) 

where the unknown g : W t — > G is a curve on a (maybe infinite-dimensional) local Lie 
group G, with 1q the neutral element; and £(i),7/(t) are given curves on the tangent Lie 
algebra q of G. Before proceeding note that, if g(t) solves (1.1), then <? _1 (t) solves (1.2) for 

n(t) = -m- 

Such equations are pervasive in mathematics, physics and engineering. To begin with, 
G can have a faithful finite-dimensional representation. For instance, consider (affme) linear 
differential equations on R n , 

x = A(t)x + b(t) with x(t ) = x . (1.3) 

They are exactly solved by 



x(t)=g(t,t )x + [ g(t,t')b(t')dt' 

J to 



where the Green's function G(t,to) is the matrix satisfying 

d8(t,t ) 



dt 



g- 1 (t : t ) = A(t), g(t ,t ) = i n 



of the kind (1.1). More generally, dynamical systems admitting a superposition principle can 
be reduced to the form (1.1), with G a finite dimensional Lie group. This assertion is part 
of the classical Lie-Scheffers theory [1], reviewed in Section 3 as part of and motivation for 
the whole enterprise. Even more generally, any non-autonomous dynamical system, given in 
local coordinates by 

^ = Y i (t;x(t)), x = &,..., x") (1.4) 

corresponds to a 'time- dependent vector field' Y with Y(t) belonging to X(M), the Lie alge- 
bra of all vector fields on a manifold M. Then the solution of (1.4) is given by the solution 
of an equation like (1.2); this remark will be formalized in Section 4. The crucial difference 
is dimensionality of the (pseudo-)group. In practice, almost always we must content our- 
selves with approximate solutions — Lie-Scheffers systems are not solvable by quadratures 
in general — and actually those are our main concern. 

We reformulate (1.1) within the framework of Hopf algebra and Rota-Baxter opera- 
tor theory — the latter has become popular recently in relation with the Connes-Kreimer 
paradigm for renormalization theory in perturbative quantum field theory. The convenience 
of such algebraic approach stems already from that, unless G is a matrix group, g and g~ x 
cannot be multiplied, and then equations (1.1) and (1.2) have no meaning, strictu sensu. 
Hopf algebras generalize both Lie groups and Lie algebras, so the problem does not present 
itself in a Hopf algebra formalism. Another advantage is that Hopf algebra and Rota- 
Baxter theory allow for efficient and meaningful comparisons among the different techniques 
for solving (1.1), proposed over the years. The main aim of this paper is to show these 
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and other benefits of our algebraic viewpoint. They have been patent for a while to people 
working on control theory, but mostly ignored by the wider community of mathematicians 
and mathematical physicists. 

We presume the readers acquainted with standard tools of differential analysis, like for 
instance in [2]: primarily the notion of tangent map and the exponential map exp : Q — > G. 
For the benefit of the readers, Lie group and Lie algebra actions are reviewed in Appendix A. 
The basics of Hopf algebra are a prerequisite. Unless otherwise specified, we consider Hopf 
algebras over the complex numbers. We briefly introduce our notations for them, which are 
like in [3]; the pedagogical paper [4] is recommended as well. Given an associative algebra 
with unit H 3 —'■ u(lc), then H ® H is associative with bilinear multiplication given by 
(a <g> b)(c ® d) = ac ® bd on decomposable tensors, and unit 1# <g> 1#. Write just 1 for the 
unit element of H henceforth. One says H is a bialgebra if algebra morphisms rj : H — > C 
(augmentation) and A : H — > H ® H (coproduct) are defined, such that the maps (77® id) A 
and (id ® 77) A from H to H coincide with the identity map id and (A ® id) A and (id <8> A) A 
from H to H ®H ®H also coincide (we omit the sign o for composition of linear maps). One 
says if is a Hopf algebra if it furthermore possesses an antiautomorphism S, the antipode, 
such that m(S <E> id) A = m(S (g) id)A = urj; where m : H Cg> H — > H denotes the algebra 
map. Familiarity with enveloping algebras and the Poincare-Birkhoff-Witt and Cartier- 
Milnor-Moore theorems in particular will be helpful. Both results can be summarized in 
the statement that a connected cocommutative Hopf algebra is the enveloping algebra of 
a Lie algebra, as an algebra, and cofree, as a coalgebra. At any rate, we discuss a strong 
version of the Poincare-Birkhoff-Witt theorem in Section 4, and the Cartier-Milnor-Moore 
theorem at the end of Section 5. The necessary notions of Rota-Baxter operator theory will 
be introduced and explained in due course. 

Whereas the Hopf algebraic description springs up naturally from the intrinsic geomet- 
rical approach, we have found it expedient to smooth this transition with the help of Lie- 
Rinehart algebras: these constitute the "noncommutative geometry" version of Lie algebroid 
technology. 

The plan of the work is as follows. In this section we explain our main aims and fix 
some notations of frequent use. Next we recommend a look at Appendix A, indispensable 
for everything that follows; most readers will just need to scan it for the notations. In 
Section 2 we address for the first time Lie-Scheffers systems; they are intimately linked to 
equations (1.1) and (1.2). 

After this, two paths are possible: either reading Appendices C and D for motivation, 
or not. Most of the stuff in them could be regarded as preceding Sections 3 and following; 
but it gets in the way of our algebraic business, and this is why it has been confined to the 
end. Section 3 plunges the reader at once into an application of Hopf algebra to differential 
geometry. This is due to Rinehart and Huebschmann, and deserves to be better known, as it 
clarifies several questions; one should compare the treatment of differential operators given 
here with that of [5, Chapter 3]. Readers less familiar with Hopf algebra might wish to read 
this in parallel with Section 4. 

Sections 4 to 6 are largely expository. In Section 4 we leisurely construct the Hopf algebra 
structure governing our approach to equations (1.1) and (1.2) from the geometric notions. 
Section 5 recalls some structure results for Hopf algebras. 

After that, our master plan is to transplant the usual paradigmatic strategies for dealing 
with (1.1) and (1.2) to the Hopf algebraic soil, which on the one hand will prove to be their 
native one, and on the other naturally leads to far-reaching generalizations. At the outset, in 
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Section 6 we consider the Campbell-Baker-Hausdorff-Dynkin (CBHD) development, which 
we proceed to derive in Hopf algebraic terms. In turn, that development is the natural father 
of the Magnus expansion method [6]. We eventually derive the Magnus series with the help 
of Rota-Baxter theory. For the purpose, the Riemann integral is treated in this paper as a 
particular Rota-Baxter operator of weight zero. 

We first show in Section 7 that skewderi vat ions and Rota-Baxter operators (of the same 
weight) are natural inverses. The ordinary Spitzer formula is revisited in Section 8, together 
with a nonlinear CBHD recursion due to one of us. The latter is instrumental in obtaining 
the noncommutative Spitzer formula. Also, inspired by the work of Lam, we give a non- 
commutative generalization of the Bohnenblust-Spitzer formula. In the next two sections, 
the Magnus expansion is arrived at as a limiting case of that formula. All along, we try to 
distinguish carefully which statements are valid for general Rota-Baxter operators, which for 
Rota-Baxter operators of vanishing weight, and which just for the Riemann integral. The 
main alternative integration method, the Dyson-Chen 'expansional' [7-9], flows from the 
Magnus series, and vice versa, by our Hopf algebraic means in Section 11. In turn, it reveals 
itself useful to understand the quirks of the Magnus expansion, and to solve the weight-zero 
CBHD recursion. Section 12 explores by means of pre-Lie algebras with Rota-Baxter maps 
the solution of that nonlinear recursion in the general case. Section 13 is the conclusion, 
whereupon perspectives for research are discussed. 

As said, Appendix A reviews the basics of Lie group and Lie algebra actions on manifolds. 
As also hinted at, Appendices B and C run a parallel, complementary strand to the main body 
of the paper. They contain more advanced material on the topic of dynamical systems with 
symmetry; their treatment here naturally calls for the Darboux derivative of Lie algebroid 
theory. The main point is to show how one is led to the arena of Lie algebra and geometrical 
integration, for the solution of differential equations we are concerned with. This provides 
a rationale for our choice of the Magnus series, and its generalizations, as the primary 
approximation method in the body of the paper — see the discussion at the beginning of 
Section 10. 

Appendix D gives the Hopf algebraic vision of a theorem of Lie and Engel. 

Several notational conventions are fixed next. Let M be a (second countable, smooth, 
without boundary) manifold of finite dimension n. The space JF(M) of (real or) complex 
smooth functions on M is endowed with the standard commutative and associative algebra 
structure. Let tm '■ TM — > M denote the tangent bundle to M. Vector fields on M can be 
defined either as sections for tm, that is, maps X : M — > TM such that tm o X = idjv/, or 
as derivations of JF(M). When we wish to distinguish between those roles, we denote by Cx 
the differential operator corresponding to the vector field X. Because the commutator of two 
derivations is again a derivation, the space X(M) of vector fields has a Lie algebra structure; 
we choose to define the bracket there as the opposite of the usual one: in local coordinates, 

[X, Yf = Y j djX i - X j djY*; 

so X(M) = cliff (M), the Lie algebra of the infinite-dimensional Lie group Diff(M) [10]. Also 
X(M) becomes a faithful J-"(M)-module when one defines hX(x) = h(x)X(x) for h £ JF(M). 
Together, JF(M) and X(M) constitute a Lie-Rinehart algebra in the sense of [11]. 

Given a smooth map f : N M, the pull-back /* : T{M) — > J-"(N) is defined as 
f*h = ho f. A vector field X £ X(N) is said to be /-related with the vector field Y £ X(M) 
if Tf oI = 7o/; we then say that the vector field X is /-project able onto the vector field 
Y, and write I ~j 7. We have X ~j Y iff the maps C x ° /* and /* o £ Y from JF(M) 



5 



to F(N) coincide. If Xi,X 2 G X(N) are /-related with Y l5 Y 2 respectively, then X\ + X 2 
and [Xi, X 2 ] are also /-related, respectively with Yi + Y 2 and [Yi, Y 2 ]. A given X G X(iV) will 
not be /-projectable in general. However, if / is a diffeomorphism, then every vector field 
X G X(N) is projectable onto a unique vector field on M, to wit, Y = Tf o X o and we 
say X is the pull-back of Y. A vector field X G X(M) is invariant under a diffeomorphism 
/ of M iff 1 ~j- 1. On M. t (or on an open interval / C M. t ) there is a canonical vector 
field d/dt. A curve 7 : M. t — > M is said to be an integral curve for a vector field X G X(M) 
if d/dt and X are 7-related: 7 := T7 o cZ/cZt = X o 7. Well-known theorems assert that the 
integral curves of a vector field define a local Reaction or flow [12]. 

By a vector field along f we understand a map Y : iV — > TM such that r M o Y = /: 

TM 

TM 

AT — M 

It is clear that the concept is just a particular case of a more general one: section along the 
map / over a general bundle n : E — > M. Vector fields along / can also be regarded as 
/-derivations, in an obvious sense. The right hand side of the non-autonomous dynamical 
system (1.4) is just the vector field along the map tc 2 : M. t x M — > M expressed in local 
coordinates by 

Y = Y i {t;x{t))d i o<K 2 . 

Also, clearly any curve 7 : M, t — > M defines a vector field 7 along 7. We envisage here the 
concept of integral curves of vector fields Y along maps /. These are curves 7 : R$ — > N such 
that the image under T/0T7 of the vector field d/dt coincides with the vector field along /07 
given by Y o 7 — depending on /, there might be vector fields along it without integral 
curves. Under this definition t 1— > (t,j(t)) is always the integral curve of (t,~f(t)) t— > j(t) 
along vr 2 : R t x M -> M. 

2 The Lie— Scheffers theorem 

Definition 1. The system (1.4) of differential equations admits a superposition principle - 
or possesses a set of fundamental solutions — if a superposition function \T/ : M n (' m+1 ) — > M n 
exists, written 

x = *(x(i), . . . , Z( TO j; fcx, . . . , fc n ), (2.1) 
such that the general solution of (1.4) can be expressed (at least for small t) as the functional 

x(t) = *(x { i)(t),...,x (m) (t);/ci,...,/c n ), (2.2) 

where { X( a ) : a = 1, . . . , m } is a set of particular solutions and k±, . . . , k n denote n arbitrary 
parameters. The latter must be essential in the sense that they can be solved from the 
solution functional: 

k = E(x(i)(t),...,X( m )(t))x(t)), with k := (h, . . . , k n ). (2.3) 

The Lie-Scheffers theorem [1] asserts: 
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Theorem 2.1. For (1.4) to admit a superposition principle it is necessary and sufficient 
that the time- dependent vector field Y be of the form 



Y{t] x) = Zx{t)Xx{x) + ■■■ + Z r (t)X r (x), (2.4) 

where, as indicated in the notation, the r scalar functions Z a depend only oft and the r vector 
fields X a depend only on the variables x; and these fields close to a real Lie algebra g. That 
is, the X a are linearly independent and there exist suitable structure constants f^ b such that 

r 

[x a ,x b ] = fg b x c . 

c=l 

Moreover, the dimension r of g is not greater than nm. Systems fulfilling the conditions of 
the theorem are called here Lie-Scheffers systems associated to g. 

Modern reviews of this subject include [13-17]. We ponder a few pertinent examples of 
Lie-Scheffers systems next. 

Linear systems (1.3) are Lie-Scheffers systems. If n+1 particular solutions xm, . . . , Xi n +i) 
of (1.3) are known, such that X(2)(t) — £(i)(t), . . . , £( n+ i)(t) — £(i)(£) are independent, and 
H (t) is the regular matrix with these vectors as columns, then the vector of parameters (2.3) 
is given by 

k = H~ 1 (t)(x(t)-x {1) (t)). 
This follows from the fact that the transformation 

x'{t) = H- l (x(t) -x ( i)(t)) 

reduces the system to dx'/dt = 0. 

A famous example for n — 1 is provided by the Riccati equation: 

x = ao (t) + ai (t)x + a 2 (t)x 2 . (2.5) 

One can understand by Hopf algebraic methods why, up to diffeomorphisms, Riccati's is the 
only nonlinear Lie-Scheffers differential equation on the real line; this was indicated in [18] 
and it is spelled in Appendix D. Also (2.5) is the simplest Lie-Scheffers system not solvable 
by quadratures; and other Lie-Scheffers equations on the line are reductions of it, in an 
appropriate sense — see Appendix B. The superposition principle for the Riccati equation is 
given by 

k = (x - x {2) )(x (l) -X( 3 )) 
(X - X(i))(x ( 2) - X( 3 ))" 

The one-dimensional example 

x = b(t) X (x) (2.6) 
is instructive. We assume x does not change sign in the interval of interest. Let 

r dr' 

4>(x): 

Then 



X(x')' 

x (t) = <p- l (k' + / b{t)dt\ 
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is the general solution. We have therefore a superposition rule of the form 

x(t)=0- 1 (0(x (1) (t)) + fc), 

with m = n = r = 1: only one particular solution is required. Notice that the local 
diffeomorphism projects the vector field corresponding to the right hand side in (2.6) to 
the vector field b(t)d x : in the language of Lie, the flow associated to this problem is locally 
similar to a translation. 

Now we can tackle at last the question of giving intrinsic geometrical meaning to (1.1). 
It turns out to correspond to a Lie-Scheffers system on a Lie group. For an arbitrary curve 
t i— ► g(t) on the r-dimensional Lie group G, we have the vector field along the curve given 
by g(-) as discussed at the end of Section 1. Then we define the left hand side of (1.1) as 

g{t)g-\t):=T m R g -, {t) g{t). (2.7) 

By construction, for each value of the parameter t, this vector lies in T\G = g, the tangent 
Lie algebra of G. We obtain in this way a curve on g. Note that, if g(t) = exp(trj), then 
simply g(t) g _1 (£) = rj. Now, if 7 a is a basis for g, then 

T 

g(t) g-\t) = Ut)l a =■ m e g, (2.8) 

o=l 

for some functions Z a (t). We realize that g(t) is an integral curve of the right invariant 
vector field along R 4 x G — ► G: 

r r 

^ G (t,g):=J2Z a (t)l a G (9) = Y, Z ^ X r^), with 1g (1 g ) = 7 a - 

a=l a=l 

Here 7^ is the fundamental vector field or infinitesimal generator of the left group translations 
generated by 7 a , which is a right invariant vector field — see Appendix A for the notations. 
In the language of (1.4), the differential system is 

r 

g{t) = Y,Ut)l a G (9{t)). (2.9) 

o=l 

The theorem says that for every choice of the functions Z a (t) we have a (right invariant) 
Lie-Scheffers system on the Lie group G, and any such system is of this form. The reader 
should be aware, nevertheless, that for a system of the type (2.9) there might be more than 
one superposition rule [17]. The reason one needs only one particular solution is precisely 
the right invariance of the last equation: if g(t) is the solution such that g(to) = 1, then 
consider g{t) := g(t) go for each g G G. We have 

tit) = T g(t) R go (g(t)) = T g{t) R, 

= TiR g{t)90 [Y.Z^A =J2z a (t) lG (g(t))- 

\0=1 / 0=1 

Equation X R (g) = TiR 9 X r (1g) has been used. Therefore g(t) is the solution of the same 
equation (2.9) with g(to) = go'- the solution curves of the system are obtained from just 



T\R g {t) l^2Z a (t)j c 
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one of them by right-translations. In other words, the superposition functional (2.2) can be 
symbolically expressed by ^(^(i), k) = g(X)k, with k G G; for which always m = 1. 

Lie-Scheffers systems live on manifolds which are not groups in general; however, they are 
always associated with the action of a finite-dimensional Lie group on the manifold on which 
Y(t; x) is defined; and this symmetry of the differential equation can be powerfully exploited 
through the action of the group of curves on the group manifold on a set of systems of the 
same type. This variant of Lie's reduction method is of wide applicability; it is explained in 
Appendix B. 

Let us finally note than in control theory, say on M = lR n , business is often with equations 
of a form not unrelated to (2.9): 

r 

x(t) = + M<t)); 

a=2 

the functions Z2, . . . , Z r being the controls. In the most important cases the X±, . . . , X r vec- 
tor fields close to a finite-dimensional Lie algebra, or X2, ■ ■ ■ , X r close to a finite-dimensional 
Lie algebra. 

3 Differential operators on Lie— Rinehart algebras 

Let R be a commutative, unital ring and A a commutative algebra over R be given. A 
derivation 5 of A is a i?-linear map from A to itself, such that 8{ab) = 5ab + a5b. Since A is 
commutative, the linear space Der(^4) of such maps becomes an ^4-module when we define 
(aS)b = a 5b. Moreover, with the usual bracket given by the commutator Der(A) is a Lie 
algebra. In this paper R = C nearly always. 

A left (right) action of a Lie algebra g on A is a Lie algebra homomorphism (antihomo- 
morphism) a : g — > Der(^4). 

Definition 2. Assume that we are given a commutative algebra A and a Lie algebra g 
which is also a faithful ^4-module. The pair (A, g) is a Lie-Rinehart algebra if there exists 
an ^4-module morphism a : g — > Der(^4), called the anchor, satisfying the compatibility 
condition 

a[X,Y] = [X,aY]-a(X)aY, (3.1) 

for a G A, X, Y G g. If we write m a for the multiplication operator m a (X) = aX and adx, 
as usual, for the adjoint operator adx(^) = [X, Y], the compatibility condition is rewritten 

as 

[ad x ,m a ] = m a(X )a- 

Often, in the definition of Lie-Rinehart algebra, the apparently stronger condition that the 
anchor be also a left action of g on A is required. But, as it turns out, these two definitions 
are equivalent. 

The concept essentially coincides with Kastler and Stora's Lie-Cartan pairs [19]. As 
already indicated (jF(M), X(M), id) is a Lie-Rinehart algebra. A more general example of 
Lie-Rinehart algebra may be (j r (M),r(M,E),a), where F(M,E) is the J r (M)-module of 
sections of a vector bundle E over M, on which a Lie bracket and an anchor a (hence a 
vector bundle map E — > TM, denoted in the same way) are supposed given. If the fibres 
have dimension bigger than one, then a linear map satisfying (3.1) is not only automatically a 
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Lie algebra morphism, but also a JF(M )-module morphism. These geometrical examples are 
called Lie algebroids. A Lie algebroid is called transitive when it is onto fibrewise; totally 
intransitive when a = 0. For examples of this, consider a principal bundle P(M, G, ir) 
over M; if VP is the vertical bundle over P, we have the exact sequences of vector bundles 

-> VP ^ TP -> TM ^ and -> VP/G TP/G -> TM -> 0; 

the second being essentially the Atiyah sequence; and then (C°°(M),r(M,TP/G),Tir/G) 
is a transitive Lie algebroid; while (C°°(M),T(M,VP/G)') is totally intransitive. 

Whenever we have a Lie-Rinehart algebra, we can algebraically define a differential 
calculus. For instance a n-form is a skewsymmetric n-linear map from g to A. If we define d 
on 1-forms by 

d(3(X,Y) = a(X)(3(Y) - a(Y)(3(X) - /3([X,Y]), 

then certainly d can be extended so d 2 = 0. Also, let V be an ^4- module. A ^-connection 
in the sense of [19, 20] is a linear assignment to each element X G g of a linear map p(A) : 
V — > V such that, for v EV , 

(ap(X))v = a(p(X))v] p(X)(av) = a p(X)v + a(X)av. 

If V is moreover a g-module, the connection is flat (as the curvature defined in the obvious 
way vanishes). 

A morphism (A, g) —> (A 1 , g') of Lie-Rinehart algebras is a pair (0, ip) of an algebra 
morphism : A — > A' and an ^4-module morphism ip : g —> g', where the action of A on g' 
is given by aX' := <p(a)X', intertwining the anchors: 

<j)(a(X)a) = a'(ip(X)) 0(a). 

An important example by Grabowski is as follows [21]. A linear operator D : g — >• g is 
called a quasi- derivation for A, and we write D G Qder^jj), if for each a G A there exists 
D(a) £ A — necessarily unique — such that [D,m a ] = fnQr a y where the bracket is the usual 

commutator. It is easily seen that D G Der(^4). Then (id^, ad) : (A,g) — > (A, Qder^(g)), 
where ad : X i— > adx, is a morphism of Lie-Rinehart algebras. 

Our first example of Hopf algebra comes now across: the (universal) enveloping alge- 
bra U(g) of the Lie algebra g. The enveloping algebra is Hopf because there is the diagonal 
algebra homomorphism 

A:U(g) ^U(g®g)~U(g)(g)U(g) by X ^ X ® X ^ X ® 1 + 1 ® X, Al = 1 ® 1, 

for every X G g. 

When g = X(M), the enveloping algebra should not be confused with the algebra (with 
the usual composition product) of differential operators D(M). It is true that the first-order 
elements of both are the vector fields, thus coincide. However, we are going to show that if 
first order differential operators are to be considered primitive elements, then D(M) cannot 
be given a natural Hopf algebra structure; we thank P. Aschieri for making us aware of 
the following argument, since published [22]. Consider the linear map £ : U(X(M)) — > 
D(M) obtained by extending the Lie derivative. This map is not onto because zeroth order 
differential operators are functions, whereas the zeroth order elements of U(X(M)) are just 
scalars. Actually, D(M) is a jF(M)-module, while U(X(M)) is not. For this very reason D(M) 
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cannot be a Hopf algebra. Consider two commuting linearly independent vector fields X, Y 
nonvanishing on a common domain (e.g. locally let X be the partial derivative <9j and Y a 
different one dj), and the vector fields aX, aY, where a is an arbitrary function, nonvanishing 
on the same domain. The composition aCxC-Y = aCyCx is an element in D(M). Suppose 
arguendo that there exist on D(M) a coproduct 5 compatible with composition of operators, 
and such that vector fields are primitives. We would have 

5((i£x£y) = cl£x£y <8> 1 + a£x <8> £y + £y ® a£x + 1 <8> aCx£-y and 
5{aCyC-x) = aCyCx ® 1 + aCy ® £x + Cx ® aCy + 1 ® aCyCx- 

Now, the right hand sides are not equal. As a corollary we have that the map £ is not 
injective: the fact that in U(X(M)) one has a good coproduct implies that aX ■ Y is there 
different ^from aY ■ X, with ■ the product in the enveloping algebra. Thus the kernel of L 
contains aX ■ Y — aY ■ X. Notice that the argument fails if M is one-dimensional. Notice 
as well that 5 makes D(M) into a good coalgebra over F(M). However, the product is then 
not JF(M)-Iinear. 

In spite of the above, Hopf algebra renders us a first great service in helping to manufac- 
ture D(M) out of li(3Z(M)). This involves a purely algebraic construction [11,23] suggested 
by the previous discussion and better presented in the context of Lie-Rinehart algebras. 

Assume for simplicity that A is unital. The universal object of (A, g) is by definition a 
triple (U(A,g),iA,i>g), where U(A, g) is an associative algebra, therefore a Lie algebra with 
the usual commutator, together with morphisms %a '■ A — > U(A,Q) and i 2 : g — ► U(A,g), 
respectively of algebras and Lie algebras, such that 

i A {a)i 5 {X) = i g (aX); [i g (X),z A (a)) = i A (a(X)a); 

and (U(A, g),iAi l g) is universal among these triples; that is, for a similar triple (5,0_4,0 fl ), 
there is a unique algebra morphism : U(A, g) — > B such that Qb^a — 'Pa an d $_b^ = 4> s - 
To construct U(A, g) we employ U(g) in the following way. The condition that the anchor 
maps into derivations means precisely that A is a Hopf W(g)-module [3]. One may keep using 
the same notation a for the new action, as for the generators; a(l) = 1. Consider now the 
smash product or crossed product algebra A x U(g). This is the vector space A®U(g) with 
the product defined on simple tensors by 

(a ®u)(b® v) := aa(u^)b ® 11(2)^, 

where we use the standard Sweedler notation Au = um (8>U(2)- There are obvious morphisms 
i' A : A ^ A>iU(g) and i' g : g — > AxU(g). Now, let J be the right ideal generated in AxU(g) 
by the elements ab® X — a® bX. One has 

(c ® X)(ab ® Y - a ® bY) = cab ® XY + ca(X)(ab) ®Y - ca ® XbY - ca(X)a ® bY 

= cab ® XY + ca(X)a b®Y + caa(X)b ® Y 

- ca® bXY - ca® a(X)bY - ca(X)a ® bY 
= cab ® XY -ca® bXY + ca(X)a b®Y - ca(X)a ® bY 

+ caa(X)b®Y - ca® a(X)bY, 

where in the last equality we just reordered terms; hence J is two-sided. By construction 
it is clear that the quotient U(A, g) := A x U(g)/J together with the obvious quotient 
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morphisms ij± and t s possesses the universal property. Note that ij± is injective. A morphism 
(0, ifj) : (A, g) — > (A 1 , g') induces a morphism of algebras W(0, ip) : U(A, g) — > U(A', g'), and 
vice versa; this is an equivalence of categories. 

One obtains by this construction the ordinary algebra of differential operators D(M) = 
U(J r (M),X(M)). It is also clear that U(C,g) = U(g) with trivial action of g. Just like the 
enveloping algebra, the universal algebra U(A, g) is filtered, with an associated graded object 
gr U(A,g), which is a commutative graded ^4-algebra. There is also a Poincare-Birkhoff- 
Witt theorem for U(A, g) when g is projective over A — which is the case in the geometrical 
examples. It claims that if S^[g] is the symmetric *4.-algebra on g, then the natural surjection 

S A [g]^grU(A,g), (3.2) 

is an isomorphism of ^4-algebras, rather like the S[g] ~ gr U(g) effected, say, through the 
'symmetrization' map a : S[g] — > U(g). In that case i s is of course injective. 



4 Coming by Hopf algebra 

We begin here a journey from the geometrical to the Hopf world. Let us start by some well- 
known observations [24]. A smooth manifold M is determined by the linear space F(M), in 
the sense that points of M are in one-to-one correspondence with a particular class of linear 
functionals on JF(M), to wit, multiplicative ones. One writes (x,h) := h(x) to express this 
correspondence. As a consequence M JF'(M), where JF'(M) is the space of compactly 
supported distributions on M. We denote by CM the subspace of T'{M) generated by the 
points of M. It will sometimes be convenient to write Th = (T, h) for the value of the 
distribution T G J-'(M) at h G !F(M); accordingly we abbreviate to xh = h(x). Many other 
geometrical objects can be expressed as functionals in this way; for instance, if v x G T X M, 
then (v x , h) = v x h is the derivative of h in the direction of the tangent vector v x at x. 
Therefore TM <^-> JF'(M), too. If X is a smooth vector field and Xh := X(h), then xX is 
defined naturally by 

(xX)h = x(Xh). 

That is, xX = X(x); and we may omit the parentheses in xXh. An advantage of thinking 
in this way is that operations in principle not meaningful on M make sense in JF'(M). For 
instance, given a curve 7 : M. t — > M with 7(0) = xq, the definition 

7(0)=lim 7(£) - X ° , 

elO e 

which looks unacceptable on M, in JF'(M) just means that for all h G JF(M): 

7(0)/, := (lim 7(£) ~ X0 , h) := lim h M)l^A. 

Let now / : N — >• M be smooth. If S G J-'(N), a corresponding element Sf is defined 
in P(M) by 

S f h := S(f*h). 

Clearly, xj = xf = f(x), and so S — > Sf extends / to a map from !F'(N) to JF'(M). 
Somewhat rashly, one denotes the extension by the same letter; with this notation, if v x is 
a tangent vector at x G N, the tangent vector T x f{y x ) at f(x) G M would become v x f. 



12 



We may freely use the notation e tx for the flow generated by a vector field X: if 7x(i, %o) 
is the integral curve of X going through x at i = and xe tx := jx(t,x), then the identity 
^(tx^j^)) — X(yx(t,%)) acquires the linear look ^(xe tx ) = xe tx X. We have indeed 
linearized the dynamical system associated to X. In more detail: if xe tx h := h( , yx(t,x)), 
then 

i=l i=l 

= Xh(-y x (t,x)) =: xe tx Xh. 

Linearization works as well for non-autonomous dynamical systems. Recall (1.4) under the 
form: 

x = Y(t;x(t)); x(t ) = x . (4.1) 
For t given, the vector Y(t;x(t)) lives in the fibre over x(t). Denote 

L(t, t )h(x) = h(x(t)), for h E F(M); then = m t )Y(t); 

at 

with Y interpreted as the corresponding time-dependent vector field. This, as announced in 
Section 1, is in the guise of (1.2). The Cauchy problem 

x = xL(t,t ); x(t ) = Xq 

has that of (4.1) as unique solution [25]. The difference with the finite-dimensional case is 
of course substantial; at the analytical level this is discussed at the end of Section 11. 

Linearization is precisely what Hopf algebra is about. Things become really interesting 
when there is a symmetry group G of the manifold M. As pointed out in [26] , linearization is 
then a particularly good idea; for instance, often the action $ is indecomposable (think of the 
case M = G and lateral action) and so contains little information; whereas the linear actions 
of G on JF(M) and JF'(M) are generally decomposable (for instance when G = M = S 1 ). 
This is the point of harmonic analysis. For a fully algebraic description of these phenomena, 
we try to regard !F'(G) and T(G), eventually restricting appropriately the functors J 7 ', J 7 , 
as Hopf algebras. 

There is no trouble in recognizing an algebra structure for the whole of JF'(G): this 
is given just by convolution, which is a map F'{G) ® T'(G) T'{G x G) — > J-'{G). 
If Si,S 2 E !F'(G), then Si * S 2 is defined as the image fi(Si,S 2 ), of the extended group 
multiplication fi : G x G — » G. This is an associative operation [27]. We have in particular 
gi * g 2 — gig 2 , for gi,g 2 E G. The unit element in JF'(G) is 1g- An augmentation on T'(G) 
is given by evaluation on the function 1 E T(G): 

T)(S) = SI. 

In particular r)(g) = 1 for all g E G. Clearly T)(Si * S 2 ) = r](Si)r](S 2 ). A candidate antipode 
is the extension of the inversion diffeomorphism % : g i— > g" x \ certainly it is an algebra 
antiautomorphism: 

i(Si*S 2 ) = i(S 2 )*i(S 1 ). 
In the sequel, the integral notation for convolution 

(Si*S 2 )h'= [ dSi(g')dS 2 (g)h'(g'g), 
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will be handy. A locally summable function h defines a distribution by h! t— > j h(g)h'(g) dg, 
with dg a (left) invariant measure on G. For instance 1 £ T'(G) if G is compact. Now S * /i 
is defined by: 

' ' dS(g')h(g)h'(g'g)dg = [ dS(g') h^'" 1 g)h'(g) dg; 



so we identify it with the function 

g i ► | dS^) /i(^-^) = / %). (4.2) 

Similarly, for h * S: 

dS(g)h(g')h'(g'g)dg' = I dS(g) h(g' g~ v )h' (g') S^ 1 ) dg'; 



where 5 is the modular function, so we identify h * S with </ i— >• J dS(g) h{g'g x ) 5(g x ). In 
particular, 

hi*h2(g)= [ h 1 (g')h 2 (g'~ 1 g)dg' = [ h^gg'^g'" 1 ) dg' = f h^gg'^h^g') 5~\g') dg' . 

JG JG J 

To give JF'(G) a coalgebra structure, one might try the following strategy. The diagonal 
homomorphism d : G — > G x G, given by (7 1 — > (g, g), extends to d : JF'(G) — >• JF'(G x G), by 
the repeatedly used procedure. However, T'(G x G) is vastly bigger than T'(G) ®T'(G). So 
we look for convolution subalgebras 0(G) of T'(G) for which O(GxG) ~ 0(G) ® 0(G). For 
a start, CG will do; and naturally the elements of G, when regarded as elements of J- '(G), 
are grouplike in the sense of Hopf algebra theory: for hi,h 2 £ T(G): 

(Ag, hi <g> h 2 ) = (g, n{hi o d ® h 2 o d)) = (g ® g ,h x ® h 2 ). 

The Hopf algebra CG is too small for our purposes. Nonetheless, recall that the tangent 
algebra q of G also sits inside J-'(G). The discussion around (A. 6) in Appendix A allows 
us to regard the elements of g as right invariant differential operators on G. So let us focus 
on the subalgebra 3 R (G) C D(G) of right invariant differential operators on F(G). There is 
great advantage in regarding any D £ 3(G) as extended to distributions by (DS)h = D(Sh). 
Now, 3 R (G) can be realized as the algebra of distributions on G with support at 1g- In 
effect, look first at the fundamental vector fields £g = X R . From (4.2), we see that in general 

Cg(S * h) = £g(S) * h; as 1q * h = h, we conclude ^g(^) = £g(1g) * h. 

For any element D of 3 R (G) analogously D(h) = D(lc)*h. Note that -D(Ig) is a distribution 
concentrated at 1g- Moreover, 

DD'h = D(l G )*D'(l G )*h, 

so the map D 1— »■ -D(Ig) is a homomorphism. It is in fact an isomorphism, as any distribution 
vanishing outside a point is a finite sum of derivatives of a Dirac function; thus conversely 
D can be written as a polynomial in the right invariant vector fields. 

Therefore we have a new subalgebra of J- '(G). Let us just write £ for £g(1g)- Further- 
more, by the Leibniz rule we are able to define the shuffle coproduct: 

A£ = £<g)l + l<g)£. (4.3) 
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This extends to 3 r (G)(1g) as an algebra homomorphism. Naturally (4.3) says that the 
elements of q, when regarded as elements of J-'(G), are primitive in the sense of Hopf algebra 
theory. A little more work shows that in fact ~D R (G) ~ U(X R (G)) = U(g), the algebra of 
right invariant differential operators coincides with the enveloping algebra of the Lie algebra 
of fundamental vector fields for the left action of G on itself. Also we remark here that the 
equivalence of the Lie algebra structures considered on TiG in Appendix A can be seen from 

see [28]. 

Our 0(G) will be the convolution algebra generated by U(g) = V> r (G)(Ig) and CG; this 
is a Hopf crossed product [3], as g * £ * g" 1 = Ad 9 £, and similarly for more general elements 
of U(g); it can be also regarded as a completion of the latter. Note A o i = (i ® i)A as 
well; we invite the reader to check the rest of the expected Hopf algebra properties. By 
the way, extending Ad to S(g) as well, it is found that the symmetrization map mentioned 
after (3.2) intertwines both actions of G. The centre Z(g) of left and right (Casimir) invariant 
differential operators is clearly a commutative algebra, isomorphic to the algebra of the 
G-invariant elements in S(g) — this states a strong form of the Gelfand-Harish-Chandra 
theorem. 

One begins to feel the power of the Hopf algebra approach: equations (1.1) and (1.2) 
make sense in 0(G) without further ado; we are allowed to write for them 

g(t)*9~\t) =£(*), 9{to) = l G or g(t) = f(t) * g(t), g(t ) = 1 G ; (4.4) 

similarly for (1.2): 

g' 1 (t)*g(t)= V (t), g(t ) = l G or g(t) = g(t) * rj(t), g(t ) = 1 G ; 

The rigorous but roundabout arguments at the end of Section 3 are simplified thereby. 
Moreover the possibility of considering interpolated equations, of the form 

g~ a (t) * g(t) * g~ b (t) = n(t), with a + b = 1, 

opens distinctly [29]. This is uninvestigated. 

Before leaving T'(G), let us note that we refrained from pondering distributions with 
point supports other than the elements of G. This would have allowed us in particular to 
consider TG <^-> ^'(G); and then g * v g > = gv g r, v g > * g — v g ig — see (A. 4) in Appendix A. 
However, one should not believe v g *v g i = v g v g i\ that is, 0(TG) does not embed into J-'(G). 

It is well known that the subalgebra H(G) of 'representative functions' in F(G), with 
its ordinary commutative multiplication, is also a Hopf algebra. The space TZ(G) is made 
of those functions whose translates x h- > h(xt), for all t G G, generate a finite-dimensional 
subalgebra of F(G). Then also 7Z(G) is endowed with a coproduct in which 

Ah e 11(G) ® 71(G) is given by Ah(x, y) := ® h$))(x, y) := h(xy); (4.5) 

which is not cocommutative, unless G is abelian. One has: 

V(h) = h(l); Sh(g) = h(g~ l ). 

Both previous constructions of 0(G) and 7Z(G) are mutually dual. Questions of duality are 
delicate in Hopf algebra theory; fortunately we need not deal with them in particular detail. 
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The main point there is the following. Given any Hopf algebra H and an algebra A, one 
can define [3] the algebraic convolution of two C-linear maps f,hE B.om(H, A) as the map 
/ * h G Hom(if , A) given by the composition 

H±H®H^A®A^>A. 

Here denotes the product map from A cg> A to A. Because of coassociativity of A, the 
triple (Hom(if, A),*, uaVh ='■ Va) is an associative algebra with unit. Now, for A = C, 
the Hopf algebraic definition of convolution on 0(G) as a dual of 1Z(G) coincides with the 
analytical one. 

Algebra morphisms respect convolution, in the following way 

£(f *h)=£f* Ih; similarly (/ * h)i = f£* hi, 

if i is a coalgebra morphism. Clearly the antipode S is the inverse of the identity map id 
for the convolution product of endomorphisms of H [3]. If / G Horn (H, A) is an algebra 
morphism, using the convolution product of End(H) one finds that its composition fS with 
the antipode is a convolution inverse for /: 

f*fS = /(id * S) = fu HVH = VA = f(S * id) = fS*f. 

Denote by Hom a i g (if, A) the convolution monoid — with unit element the map r\A — of multi- 
plicative morphisms of H on the algebra A. In general fS does not belong to Hom a i g (if, A); 
but it does when the algebra A is commutative. Moreover, if A is commutative, the convo- 
lution product of two multiplicative maps is again multiplicative, so Hom a i g (if, A) becomes 
a group, that we may call Gh{A). In particular, this happens for the set Cr#(C) of scalar 
characters, and for Gh(H) if H is commutative. Thus we have a (representable by defini- 
tion) functor Gh going from commutative Hopf algebras to groups. We may call Gh an 
'affine group scheme'. If we suppose H to be graded, connected (meaning that the scalars 
are the only elements in degree zero) and of finite type, then G#(C) is a projective limit of 
triangular matrix groups. An important example is studied in Appendix D. 

In general there will be only an embedding — that can be made continuous — of G into 
the group G-e/g) (R) of characters of 11(G); under favourable circumstances (for instance, 
for G compact, thanks to the Peter-Weyl theorem) both groups coincide. Also if A is 
commutative then Hom(H, A) is an A-algebra. Then a Lie algebra Qh(A) can be obtained 
as well by considering the elements L ('infinitesimal characters') of Hom(if, A) satisfying the 
Leibniz rule 

L(cd) = rjA(c)Ld + rjA(d)Lc, 

for all c,d G H. The bracket [L\, L2] '■= L\ * L2 — L2 * L\ of two infinitesimal characters is 
an infinitesimal character, and so we have a functor Qh from commutative algebras to Lie 
algebras. Needless to say, under favourable circumstances 0tj(g)(K) is just q. 

To summarize, the situation is here quite different of that examined in Section 2, whereby 
we showed B(G) * U(3L(G)), whereas B R (G) ~ U(X R (G)). Notice that B R (G) can be 
expressed directly in Hopf theoretic terms, as follows: a derivation of the commutative 
algebra 11(G) belongs to 3 R (G) iff it is of the form L * id, with L an infinitesimal character 
of 1Z(G). Here the convolution of an endomorphism of 1Z(G) and an element of 0(G) is 
clearly well defined; and indeed 

L * id(M 2 ) = D(ht)h 2 + htD(h 2 ), 
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after a short calculation. Right invariance of L*id is clear. Reciprocally r/D is an infinitesimal 
character. All this is in [30]. 

The books [28, 31, 32] and the review article [33] are good references for most of this 
section. 

5 Some structure results for Hopf algebras 

Familiarity with the tensor T(V) and cotensor (or shuffle) T*(V) Hopf algebras is very 
convenient; we survey them here. Consider a countable basis B = { v±, . . . , v p , . . . } of the 
vector space V, and think of it as an alphabet, a word of this alphabet being a finite sequence 
of i>'s. We let T*(V) be the vector space generated by the set of words and 1 (corresponding 
to the empty word). The length of a word w = v^- ■ -v in is denoted by \w\ = n; naturally 
|1| = 0. Introduce a noncocommutative (deconcatenation) coproduct on T*(V) by the 
formulae Al = 1 ® 1 and 

n 

Aw = v h ■ ■ ■ v ip ® v ip+l ■■■v in , 

with the agreement that when all the terms are on the one side of the tensor sign there is 
a 1 on the other side. Notice that 

(A ® id) A(v h ■■■v in )= Y v n'-- v ip ® v ip+1 ■ ■ -v iq ® v iq+1 ■■■v in = (id <8> A)A(v h ■■■v in ), 

0<p<q<n 

understanding that when p = q the middle term of the summand is 1. Hence (T*(V), A, 77) , 
where rj : T*(V) — > M is defined by 77(1) = 1 and rj(w) = if \w\ > 0, is indeed a coalgebra; 
by the way, any commutative Q-algebra at the place of the real numbers would do here. The 
dual vector space consists of all infinite series of the form ^i v ij where / = { i%, . . . , i n } 
and v'j denotes the dual of v 1 :— v ^ • ■ • Vi n . It becomes an algebra with product 

Wjv'k^i) ■= (v'j ® v' K , Auj) . 

Since the right hand side vanishes unless J UK = I as ordered sets, and in that case equals 1, 
this product is simply concatenation: 

In other words, this dual is the algebra IR [[£']] of noncommutative formal power series in the 
variables Vi, which is the (Krull topology) completion of the algebra M[B'] of noncommutative 
polynomials in the same variables — that is the tensor algebra T(V), as tensor product is 
given by concatenation. 

It is clear that T(V) is a free associative algebra on B' [34]. Moreover, if C(B') is the 
free Lie algebra on B', from the universal properties of C(B') and of U(£(B')^ it follows 
that also W (£(£')) 

is a free associative algebra on B'\ therefore T(V) = U(£(B')\ In 
particular, we have a Hopf algebra structure on T(V), which is inherited by its completion. 
Their cocommutative coproduct is given on monomials by the formula 

n 

A « •■-<)= Yl Yl v '«ii) ■ ■ ■ v '°{i P ) ® ■ ■ ■ v '°<M' 

P=0 (TGSn.p 
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Here we deal with (p, n — p)-shuffles, that is, permutations a of [n] = { 1, . . . , n } such that 
cr(l) < • ■ ■ < a{p) and a(p + 1) < • • • < a(n); we write a G S n>p . From this coproduct we 
obtain a product fti on T*(V) by dualization: 

(v'j, vj rh u*) := (Avj, vj ® (5.1) 

Explicitly, the commutative shuffle product rh is given by 

«ii • • • v i P ^ <Vi • • • v i n = Vi °m ' ' ' Vi °in) ■ 
For instance v (Vj rh = v(UjV k + v(u k Vj + v k ViVj and 

rh Vfcty = ViVjV k Vi + ViVkVjVi + ViV k ViVj + V k ViVjVi + V h ViV X Vj + V k ViViVj. 

It is also easy to check the following formula, which can be employed as a recursive definition 
of the shuffle product: 

v h ■ ■ ■ v ip rh v ip+1 ■ ■ ■ v in = (v h ■■■v ip (h v ip+1 ■ ■ ■ v^Jv^ + (v ip+1 ■ ■ ■ v in rh v h ■ ■ ■ v^Jv^. 

The coproduct on T*(V) and rh are compatible since they are respectively obtained by dual- 
ization of the product and coproduct of the Hopf algebra M[[5']]. The resulting commutative, 
connected, graded Hopf algebra Sh(V r ) = T*(V) is called the shuffle Hopf algebra over V. 
The construction does not depend on the choice of the basis B, since all the algebras involved 
only depend on the cardinality of B. The antipode on T(V) is given by 

S(v[.--v' n ) = (-irv' n -.-v[ ] 

by duality the same formula holds on Sh(V). 

Every polynomial P G WL[B'] can be written in the form P = P n where P n is the sum 
of all monomials of P of degree n. It is called a Lie element if Pq = and each P n belongs to 
the free Lie algebra generated by the v[. The following is a classical theorem by Friedrichs. 

Theorem 5.1. A polynomial P is a Lie element if, and only if, it is primitive. 

Proof. The 'only if part follows easily by induction from the obviously true assertion for 
n — 1. To prove the converse we invoke in context the Dynkin operator D, for whose study 
we recommend [29,35,36]. An abstract definition of D is D = S*Y, where Y is the derivation 
given by the grading; equivalently id * D = Y. If P n is primitive, then so are 

nP n = YP n = 7r(id ® D)(l ® P n + P n ® 1) = D(P n ); 

and vice versa. But D(P n ) is a Lie element, as it corresponds to the left-to-right bracketing: 

D(xi 1 . . . Xi n ) = [. . . [[xitjXiz], Xi 3 ], . . . , Xi n ] (the Dynkin-Specht-Wever theorem). 

To prove the last equality, note that it is true for \w\ = 1. Assume that it holds for all words 
of degree less than n, and let w = xx in — x^ ■ ■ ■ x in . Then 

Aw = AxAx in = (x ( i) <S> X{2)){x in <g> 1 + 1 ® x in ) = x w x in <g> x (2 ) + x {1) ® x^)X in . 
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Since Sx in = —x in and r)(x) = 0, 

(S * Y)w = S(x {1) x in )Yx {2 ) + Sx {1) Y(x {2 )X in ) 

= S(xi n )Sx(i)Yx{2) + Sx(x)Yx(2)X in + Sx^x^Yx^ 

= -x in Sx {1 )Yx i2 ) + Sx { i)Yx {2 )X in 

= [Dx, Xi n ] [. . . Xj 2 ], Xj 3 ], . . . , Xi n ], 

upon using the induction hypothesis in the last equality. (The definition D = Y * S would 
work the same, yielding right-to- left bracketing.) □ 

When V is finite dimensional, the previous argument of Friedrichs' theorem goes through 
for formal power series, because the homogeneous components are polynomials, and there is 
only a finite number of words of a given length. In the infinite-dimensional case Lie series 
are defined as those such that their projections to any finite-dimensional subspace V are Lie 
series over V, so the theorem also holds for series in the infinite- dimensional context [34, 
Section 3.1]. 

Given a power series Z, let us denote by (Z, w) the coefficient of the word w in Z. The 
topology in R[[_B']] alluded above is the weakest topology such that for each w the mapping 
Z i — > (Z, w) is continuous, when M is equipped with the discrete topology. In particular, 
the neighbourhoods of are indexed by finite sets of words, and correspond to those series 
whose coefficients vanish on all the words of the given finite set. Thus, given a sequence of 
series (Z n ) such that for each neighbourhood of 0, all but a finite number of Z„'s are in this 
neighbourhood, their sum Z n is defined as the power series Z satisfying 

{Z,w) = ^2(Z n ,w). 

n 

This sum makes sense since only finitely many terms are different ^from zero for each w. 
Notice that Z can be written as Z = ^2 W {Z, w)w, where the sum runs over the set of words. 
(Henceforth we shall no longer be fussy on 'topological' matters.) 

When Z is a series such that (Z, 1) = 0, then the expression Yl n ^nZ 11 has a meaning for 
any choice of the numbers A n . In particular, we may define exponentials and logarithms as 
usual: 

oo „ n °° (_-l\n— 1 

exp(Z) = V— and log(l + Z) = V ^J—Z", 
z — ' n\ L — ' n 

ra=0 n=l 

As expected 

log(exp(Z)) = Z, and exp(log(l + Z)) = l + Z. 

and routine calculations establish that exp is a bijection from the set of primitive elements 
in the completion of T{V) into the set of grouplike elements, and vice versa for log. 
Equation (5.1) entails 

x)w (g> X. 

Since for grouplike elements AZ = J2 W x(Z, w)(Z, x)w ® x, it follows that 

(Z,w(hx) = {Z,w)(Z,x), (5.2) 

for them. This of course means that the grouplike elements of are precisely those Z 

for which the map w i— > (Z, w) is an algebra homomorphism for the shuffle product. This 
characterization is originally due to Ree [37]. 
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We collect next some elements of structure theory of commutative or cocommutative 
Hopf algebras — mostly due to Patras [38, 39] — beginning by a 'double series' argument 
similar to the one in [34] for the shuffle-deconcatenation Hopf algebra. 

Consider, for H = a graded connected commutative Hopf algebra with aug- 

mentation ideal H + and graded dual H', a suitable completion H £g) H' of the tensor pro- 
duct H (g> H' . This is a unital algebra, with product m ® A* and unit 1 ® 1. Now by Leray's 
theorem — an easy dual version of the Cartier-Milnor-Moore theorem — our if is a symmet- 
ric algebra over a supplement V of H\ in H + [3,39]. Let A index a basis for V, let A (the 
monoid freely generated by A) index the words X u , and let Z u denote an element of the dual 
basis in H'; then the product on H <g) H' is given by the double series product: 

The linear embedding End H ^ H ® H' given by 

f^J2f(X u )®Z u , 

is really a convolution algebra embedding 

(End H, *) — > (if <g) if', m <g> A*). 

Indeed, 

^/(X^^jf^^X^^j = ^ f{X u )g{X v ) <g> 

tGA m,iiGA 

= ( D f{X u )g(X v ) (Z u <g> Z„, AX t ) J ® ^ 

tGA m,iiGA 

= ^ f (5.3) 

tGA 

Notice that the identities wq for convolution and id for composition in End H correspond 
respectively to 

urj \— > 1 <g> 1 and id i— > X u ® Z u . 

ugA" 

Denote 

:= 7 ' ' ' Z Uk , X w ) X m ■ ■ ■ X Uk = : log* idX„,. 

k>l 
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Using the same idea as in (5.3), we get 



■ — 1 — 5Z ^ ■ ■ ■ ® ' ' ■ 



k 

= ^ ^2 ~ ^ ' ' ' Z u>ki X w) X ui ' ' ' ® 

luei fc>i m,...,n fe ^i 

= Y,MX W )®Z W . (5.4) 
We moreover consider the endomorphisms 7r n := 7r™/n! so that, by (5.3): 



We may put 7r := W77. Thus, if a G is of order n, ir m (a) = for m > n. Furthermore, 
for n > 0, 



■a* *n *ca*^ (log*(id")) m ^ (log* id)- ^ 

id a = exp (log (id ))a= > ; a — > t ; a— > t 7r m (a). (5.5) 

^— ' m! ^— ' m! ^— ' 

m=l m=l m=l 

In particular id = 2~^ m >o The graded maps id* n are called the Adams operations or 
characteristic endomorphisms of H; they play an important role in the (Hochschild, cyclic) 
cohomology of commutative algebras [40-42] . The 7i n are often called Eulerian idempotents. 
We have for them: 

Proposition 5.2. For any integers n and k, 

id* n id* fc = id* nfc = id* fc id* n . (5.6) 

and 

TTm.TTfc = 5 mk 1T k . (5.7) 

Proof. The first assertion is certainly true for k = 1 and all integers n, and if it is true for 
some k and all integers n, then taking into account that id is an algebra homomorphism, the 
induction hypothesis gives 

id *n id *fc+l = id *n( id *fc + = id *nfc + id *n = id *n(fc+l)_ 

Substituting the final expression of (5.5) in (5.6), with very little work one obtains (5.7). So 
indeed the tt^ form a family of orthogonal projectors. □ 

Thus the space H = if( m ) always has the direct sum decomposition 

# = 0# n :=07r n (#). (5.8) 

n>0 n>0 
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Moreover, from (5.5), 

id* l H n = l n H n , 

so the H n are the common eigenspaces of the operators id*' with eigenvalues l n . Thus, the 
decomposition (5.8) turns H into a graded algebra. Indeed, if a G H r and b G H s , then 

id*'(a&) = id*W*'& = l r+s {ab), 

and therefore m sends H r ® H s into H r+S . We shall denote by the restriction of 7r n to 
H^ m \ the set of elements of degree m, with respect to the original grading. 

If H is cocommutative instead of commutative, the previous arguments go through. One 
then has 

log^^X u ®Z u ^ = ^X w ®th{Z w ). 

u£A we A 

Furthermore, in this case the Eulerian idempotents of H are the transpose of the Eulerian 
idempotents of the graded commutative Hopf algebra H 1 . In particular, for H cocommuta- 
tive, tti(H) = P(H), the Lie algebra of primitive elements in H. This is easily sharpened into 
the following version [39] of the Cartier-Milnor-Moore theorem: the inclusion ni(H) <— >• H 
extends to an isomorphism of U(tii(H)) with H. 

6 The CBHD development and Hopf algebra 

There are three paradigmatic methods (and sundry hybrid forms) to deal with first order 
non-autonomous differential equations: the iteration formula or Dyson-Chen expansional, 
the Magnus expansion and the product integral. For reasons expounded later, at the begin- 
ning of Section 10, in this paper we look first for the Magnus expansion [6]. In the influential 
paper [43] dealing with the latter method (although Magnus' seminal contribution is not 
mentioned) the famous Campbell-Baker-Hausdorff-Dynkin (CBHD) formula in Lie algebra 
theory is shown to be a special case of general formulas for the solution of (1.2). This is 
scarcely surprising, as that solution involves some kind of exponential with non-commuting 
exponents; also the quest for 'continuous analogues' of the CBHD formula was a motivation 
for Chen's work. Conversely, a heuristic argument for obtaining Magnus' expansion from the 
CBHD formula has been known for some time [13,44]; and a routine, if rigorous and Hopf 
flavoured as well, derivation of Magnus' method from CBHD is available in [45] . Hence the 
interest, as a prelude to our own derivation of the Magnus expansion from the CBHD devel- 
opment (that will employ the concept of nonlinear CBHD recursion and Rota-Baxter theory 
techniques) of rendering the proof of the CBHD expansion in Hopf algebraic terms. This 
was recognized as the deeper and more natural approach to the subject some fifteen years 
ago, but remains to date woefully ignored. Standard treatments of the CBHD development 
can be found in good Lie group theory books like [46]. 

In the sequel we follow [47] and [48]. It will be soon clear to the reader, according 
to the previous discussion, that the CBHD formulae are universal; thus we can as well 
return to the case where H is the Hopf tensor algebra T(V) and where V possesses a basis 
B — {Xx, ...,X n }. The CBHD series £ m >i $ m (X u . . . , X n ) is defined by 

Y,*m(X u ...,X n ) =log(e Xl ---e x "), 

m>l 
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where $ m (Xi, . . . , X n ) are homogeneous polynomials of degree m. 

Now, if a is a grouplike element in a Hopf algebra H, and /, h G Hom(if, A), where A is 
a unital algebra, then 

/ * h(a) = f{a)h(a). 

In particular 

\og(e Xl e X2 ■■■e Xn ) = log* id(e Xl e Xa • ■ ■ e Xn ) =: tti (e Xl e Xa • ■ ■ e x "). (6.1) 
Take first n = 2. Then $ m (X, Y) = 7r( m ' ) (e x e y ) . The Cauchy product gives 

j£i yn—i 



e x e Y 



%\ (n — i ! 

m>Q \i=0 v 



hence 

$ m (x,n= y 



$ m (X,F)= £ ^vrJ m) (W). 

j+j'=m 

A similar argument entails the following proposition. 



Proposition 6.1. 

$ m (X 1? . . . , X n ) = £ -j^—i 4"° W^ 2 • ■ ■ (6.2) 

where the sum runs over all vectors (z'i, . . . , i n ), with nonnegative coordinates, such that 
i\ H h i n = m. 

Denote by </? n (Xi, . . . , X n ) the 'multilinear' part of $ re (Xi, . . . , X n ) — that is, the ho- 
mogeneous polynomial of degree n that consist of those monomials of <& n (Xi, . . . ,X n ) that 
include all the Xj's. This amounts to take Xf = in <fr n (Xi, . . . , X n ), for all i. So by (6.1) 

<p n (X l7 ...,X n )=TT^\X 1 ---X n ), 

since in that case 

e *l... e Xn = (l +Xl )---(l+X n ) 

= J2 Xi + J2 X * X 3 + J2 X ^ X o X k + --- + X l ---X n . (6.3) 

i i<j i<j<k 

Now, if X a := (Xi, . . . ,X n ) ■ a := (X a m, . . . , X a ( n \) denotes the standard right action 
of the symmetric group S n on V"®", then the monomials that include all the X^s are of the 
form X a , therefore 



n[ n \x 1 ---X n )= J2c a X a , 

<?£S n 

for some coefficients c CT , that we shall determine in a moment. 



Proposition 6.2. 



where d(o~) is the number of descents of a, that is, the number of 'errors ' in ordering con- 
secutive terms in cr(l), . . . , a(n). 
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Proof. Assume that a has d descents, say in n ,n + rii, n + n\ + • • • + nj_i, set rij = 

n-n rij-i and let Z = £. X t + £ {<j XiX 3 + ■ ■ - + X X ■ - ■ X n . By (6.3), e Xl ■ • ■ e x " = 

Z + y, where 7 is a collection of terms that contains at least one factor of the form Xf, 
therefore they will not contribute to the coefficient of X a , and we neglect them. Now, since 
log(l + Z) — ^2 Z'i we have to compute the contribution c(j) from each power ZK 

Suppose that the monomial X a (\\ ■ ■ ■ X c i no \ is built ^from ji monomials of Z, and in 
general that each monomial X a ^ no+ ... +n ._ 1+1 ^ ■ ■ ■ X er ( riQ+ ... +rH ) is the product of ji monomials 
of Z. Notice that there are ( n *~^) manners to construct each monomial, in such a way, 

because X a ^ no ^ hn ._ 1+1 ) is always in the first monomial, and once the first ji — 1 monomials 

are chosen, the last monomial is fixed since a is increasing in each segment. Thus 



E,6:i)&:!)-(i:i) 



where the sum extends over all vectors (j , . . . , j d ) satisfying j H — -+jd = j- Since * s 

the coefficient of x^ h ~ x in the binomial expansion of (1 + x) nk ~ 1 , and X^f=oC?» — -0 = J — ^ — 1 ? 
c(j) is the coefficient of x^~ d ~ l in 



J(l + x) nk ~ l = (1 + x )£toK-i) = (1 + x ) 



d— 1 

l^i -r x'j - = ys. -v xj"'- ui • ' = \l -|- x 

i=0 

we therefore conclude that 



n — <i — 1 
3-d- I, 

Now, we have j < n since X a has n letters. Also j > d + 1 as X CT is broken in d + 1 parts. 
Therefore 

V (-l^ Ai-d-lN (-1)- /m 

j=d+l 7 i=0 1 1 \ 

where m — n — d — 1. Now, from the binomial identity 

Finally, a simple induction, using integration by parts, gives 

\, > m j, d\m\ 1 /c? + m\" 1 1 fn— 1 

(1 - x) x dx 



(m + d+l)\ m + d + l\ d J n\ d 
Our task is over. But the number of descents will reappear soon enough. □ 

This construction performed here is arguably more elegant and simpler than the standard 
treatments of the CBHD development by purely Lie algebraic methods. We came in by the 
backdoor, using the bigger free associative algebra, knowing that log(e Xl • • • e Xn ) — and each 
of its homogeneous parts — is primitive, i.e., a Lie element; and that we have the Dynkin 
operator to rewrite it in terms of commutators. 

Let us exemplify with the case n = 2. Obviously we have 

^{X, Y)=X + Y- $ 2 (A, Y) = Y\. 
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Now, 



7rf \XiX 2 X 3 ) = ~X( 12 3) - - (X( 132 ) + X( 2 13) + -^(231) + ^(312)) + g^(321)- 



Therefore 

$ 3 (X,Y) = l -{-nf\x 2 Y) + ^\XY 2 )) 

= - \-X 2 Y - -XYX + -YX 2 + -XY 2 - -YXY + -Y 2 X 
2 V6 3 6 6 3 6 

= ^([[X,Y},Y]-[[X,Y},X}). 
Both cubic Lie elements appear in $3. Similarly 

7r( 4 ' ) (XiX 2 X 3 X 4 ) = -X(1234) — — ^X( 12 43) + X( 132 4) + X( 1342 ) + X( 142 3) + X( 2 134) 
+ X(2314) + X(234l) + X(2413) + X( 312 4) + X( 3412 ) + X( 412 3) 



+ Y2 ( ^( 14 32) + -^(2143) + X(2431) + X(3i42) + X(3214) + X(324l) 
+ X/3421) + Xf 4132 1 + X( i2 i3) + Xm231) + X( i3 i 2 ) rXtfr 



(3421) T ^-(4132) T ^-(4213) "I" ^-(4231) i -^(4312) J — ^^-(4321) 

We concentrate on X 2 Y 2 , as it is clear that most terms coming from X 3 Y or XY 3 will 
vanish; and in fact the corresponding contributions in toto come to naught. We obtain 

$ 4 (X, Y) = — {AXYXY + 2XF 2 X + 2YX 2 Y - 3YXYX - 2XF 2 X - 3YX 2 Y) 

192 

= ~[[[x,n*\,n 

The identity of Jacobi has been used, under the form 

[[[X,Y],X],Y] = [[[X,Y],Y],X\. 

It is remarkable that the other quartic Lie elements, [[[X, Y], X], X] and [[[X, Y], Y], Y], do 
not appear in the fourth degree term. 



7 Rota— Baxter maps and the algebraization of integration 

This paper draws inspiration partly from [30], where Connes and Marcolli have introduced 
logarithmic derivatives in the context of Hopf algebras. Our intent and methods are different; 
but it is expedient to dwell here a bit on their considerations. Given H and A commutative 
as in the last part of Section 4, and a derivation 5 on A, for a multiplicative map G Gh{A) 
Connes and Marcolli define two maps in Hom(if, A) by 5(<f)) := 5 o 0, and then 

D 5 (<P) :=0" 1 *5(0) 
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This yields an A-valued infinitesimal character. Indeed, using Sweedler's notation and mul- 
tiplicativity of £ Gjj(A), one has 

D s {<j>)[cd] = <j>~ x * S(</>)[cd\ = m A (0~ 1 ® 5(<P))A(cd) = 0- 1 (c (1)C / (1) )5(0(c (2) rf (2) )) 

= 0- 1 (qi))r 1 (rf(i))(5(0(c( 2 )))0(rf( 2) ) + 0(c (2) W(d (2) ))) 

= 0- 1 (c (1 ))5(0(c (2) ))0- 1 (rf (1) )0(rf (2) ) + 0- 1 (c (1 ))0(c (2 ))0- 1 (rf (1) )5(0(c/ (2 ))) 

= D 5 (0)[c]^(d) + ^(c)L> 5 (0)[d]. 

Therefore D$((j)) belongs to Qh(A). 

The Dynkin operator appearing in Section 5 — one of the fundamental Lie idempotents in 
the theory of free Lie algebras [34,36] — is a close cousin of the logarithmic derivative Dg(g). 
Consider Gh{H), for H connected and graded. The grading operator Y is a derivation of H 

Y{hti) = Y{h)ti + hY{ti) =: \h\hh' + hh'\h'\. 

The map Y extends naturally to a derivation on End(-ff). With f,g £ End(if) and h £ H 
we find 

Y(f*g)(h) := f*g(Y(h)) = \h\(f*g)(h) = \h\f(hU)g(hW) 

= \h®\f(hV>)g(hW) + \h®\f(hV>)g(h®) = Yf * g(h) + / *Yg (h), 

where we used that A(Y(h)) = \h\A(h) = (\h^\ + \h^\) ® /i^. Now, as before, 
convolution of the antipode S with the derivation Y of H defines a Dynkin operator, to be 
interpreted as an H- valued infinitesimal character [29]. 

Suppose we have a smooth map t \— > L(t) from R t to We could say that one of 

the main aims of this paper is to solve for g(t) the initial value scheme 

D d/dt (g(t)) = L(t); g(0) = Va , (7.1) 

at least for (real and) complex points. Now, both the classical notions of derivation and 
integration have interesting generalizations. It would then be a pity to limit ourselves to the 
classical framework; and so we now jump onto a somewhat more adventurous path. 

For integration, one lacks a good algebraic theory similar to the one developed in [49] , say. 
Next we elaborate on a somewhat unconventional presentation of the integration-by-parts 
rule using the algebraic notion of the weight-^ Rota-Baxter relation corresponding to the 
generalization of the Leibniz rule in terms of weight-^ skew derivations. One should strive 
for nothing less ambitious than developing Rota's program, beautifully outlined in [50] in 
the context of Chen's work [8], of establishing an algebraic theory of integration in terms of 
generalizations of the integration-by-parts rule. 

Let us recall first the integration- by-parts rule for the Riemann integral map. Let A : = 
C(R) be the ring of real continuous functions. The indefinite Riemann integral can be seen 
clS db linear map on A 

I-.A-+A, I(f)(x):= P f(t)dt. (7.2) 

Jo 

Then, integration-by-parts for the Riemann integral can be written as follows. Let 

F(x) := I(f)(x) = [ fit) dt, G(x) := I(g)(x) = f git) dt; 

Jo Jo 
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then 

Jo m jt^ {t)) dt = F W G W ~ JJjt^ G{t) dt 
More compactly, this well-known identity is written 

I(f)(x)I(g)(x) = l(l(f)g)(x) + l(fl(g))(x), (7.3) 

dually to the Leibniz rule. 

Now, we introduce so-called skewderivations of weight 9 G R on an algebra A [51]. A 
skewderivation is a linear map 5 : A — > A fulfilling the condition 

5{ab) = aS(b) + 6{a)b - 95{a)5(b). (7.4) 

We call skewdifferential algebra a double (A, <5; 9) consisting of an algebra A and a skewderiva- 
tion S of weight 9. A skewderivation of weight 9 = is just an ordinary derivation. An 
induction argument shows that if A is commutative we have 



„n—i 



a n - l 5(a) 1 . 



i=l 

Also 



i=0 ^ / j=0 \ J / 

Both formulae generalize well-known identities for an ordinary derivation. We mention 
examples. First, on a suitable function algebra A the simple finite difference operation 
5 : A — > A of step A, 

TO - /(I " A |" /W . (T.5) 

satisfies identity (7.4) with = —A. See [52] for an interesting application of the A = 1 
case in the context of multiple zeta values. A closely related, though at first sight different, 
example is provided by the g-difference operator 

(q — l)x 

which satisfies the g-analog of the Leibniz rule, 

s q(f9)(x) = 8 q f(x)g(x) + f(qx)5 q g(x) = 5 q f(x)g(qx) + f(x)5 q g(x). 

This corresponds to relation (7.4) for 9 — (1 — q), modulo the identity 

$q(fg)( x ) = S q f(x)g(x) + f(x)5 q g(x) + x(q - l)5 q f(x)5 q g(x); 

defining now 5 q = x5 q , it is a simple matter to check that 5 q is a skewderivation of weight 1—q. 

We may ask for an integration operator corresponding to the skewderivation in (7.5). On 
a suitable class of functions, we define the summation operator 

Z(f)(z):=Y,Of(x + en). (7.7) 

n>l 
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For 5 being the finite difference map of step 9, 

f(x + 9n-9)- f(x + 6n) 



Z6(f)(x) = ^(/)(z + 0n)=5> 



9 

n>l n>l 



n>l n>0 n>l 

As S is linear we find as well SZ(f) = f. Observe, moreover, that 

(J20f(x + 9n))(^0g(x + 9m)) = ^ 9 2 f(x + 9n)g(x + 9m) 

^n>l ' ^m>l ' n>l,m>l 

= ( Yl + E + E W^ + ^Mx + flm) 

^n>m>l m>n>l m=ra>l/ 

g[x + ^(/c + n)) /(x + 6>n) 

m>l ^fc>l ' n>l ^fc>l ' 

+ Y J 9 2 f(x + £n)s(x + 0n) = Z(Z(f)g)(x) + Z(fZ(g))(x) + 9Z(fg)(x). (7.8) 

n>l 

Related to the g-difference operator (7.6) there is the Jackson integral 
J[f\(x)--= [ X f(y)d q y=(l-q)J2f(xq n )xq n (0 < q < 1). 

^ n>0 

This can be written in a more algebraic way, using the operator P q [f] := ^ re>0 [/], with 
the algebra endomorphism (g-dilatation) £ , g [/](x) := f(qx), for / G A. The map P 9 is 
a Rota-Baxter operator of weight —1 and hence, id + P q =: P g is of weight +1, see [60]. 
Jackson's integral is given in terms of the above operators P q and the multiplication operator 
M[f](x) := xf(x), f G A, by J[f]{x) = (1 - g)P 9 M[/](x). The modified Jackson integral J, 
defined by J[/](a;) = (1 — q)P q [f](x), satisfies the relation 

J[f] J[g] + (l-q)J\fg] = J[f J[g]] + J[J[f] g]. 

For motivational reasons we remark that the map P q is of importance in the construction 
of g-analogs of multiple-zeta-values. The examples motivate the generalization of the dual 
relation between the integration-by-parts rule and the Leibniz rule for the classical calculus. 

Definition 3. A Rota-Baxter map R of weight 9 G 1R on a not necessarily associative 
algebra A, commutative or not, is a linear map R : A — > A fulfilling the condition 

R(a)R(b) = R(R(a)b)+R(aR(b))-9R(ab), a,b G A. (7.9) 

The reader will easily verify that R := 9 id — R is a Rota-Baxter map of the same weight, as 
well. We call a pair (A, R), where A is an algebra and R a Rota-Baxter map of weight 9, a 
Rota-Baxter algebra of weight 9. The indication 'not necessarily associative' is indispensable 
in this paper, as we soon meet Rota-Baxter algebras that are neither Lie nor associative. 
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We state a few simple observations, which will be of use later. The so-called double 
Rota-Baxter product 

x * R y := xR(y) + R(x)y - 9xy, x,y G A, (7.10) 

endows the vector space underlying A with another Rota-Baxter algebra structure, denoted 
by (Ar, R). In fact, R satisfies the Rota-Baxter relation for the new product. One readily 
shows, moreover: 

R{x* R y) = R{x)R(y) and R(x * R y) = -R(x)R(y), x,y G A. (7.11) 

This construction may be continued, giving a hierarchy of Rota-Baxter algebras. 

Proposition 7.1. Let (A,R) be an associative Rota-Baxter algebra of weight 9 G R. The 
Rota-Baxter relation extends to the Lie algebra A with the commutator [x, y] := xy — yx, 

[R(x),R(y)} + dR([x,y]) = R([R(x),y] + [x,R(y)]) 

making (A, [.,.], R) into a Rota-Baxter Lie algebra. 

This is a mere algebra exercise. A more exotic result coming next will prove to be 
important in the context of Magnus' expansion and beyond. 

Proposition 7.2. Let (A, R) be an associative Rota-Baxter algebra of weight 9 G M. The 
binary composition 

a- R b:= [a,R(b)}+9ba (7.12) 

defines a right pre-Lie (or Vinberg) product such that A becomes a Rota-Baxter right pre-Lie 
algebra. 

Proof. Recall that for a pre-Lie algebra (A, •) the (right) pre-Lie property is weaker than 
associativity 

a ■ (b - c) — (a ■ b) ■ c — a ■ (c • b) — (a ■ c) ■ b, Va, b,c G A, 

As the Jacobiator is the total skewsymmetrization of the associator, the pre-Lie relation is 
enough to guarantee that the commutator [a, b] := a ■ b — b ■ a satisfies the Jacobi identity. 
For the sake of brevity we verify only the weight-zero case and leave the rest to the reader. 

a -r (b - R c) - (a - R b)- R c= [a, R([b, R{c)])] - [[a, R(b)], R(c)] = 
= [a,R([b,R(c)])] + [[R(c),a],R(b)] + [a,[R(c), R(b)}] 
= [a, R([c, R(b)])] - [[a, R(c)}, R(b)] =: a - R (c - R b) - (a - R c) - R 6; 
and R(a) - R R(b) = [R(a), R(R(b))} = R([R(a), R(b)}) + R([a, R(R(b))]) 
= R(R(a) - R b) + R(a - R R(b)) . 

Here we used Proposition 7.1 as well as the Jacobi identity. □ 

The Lie algebra bracket corresponding to the double Rota-Baxter product (7.10) is the 
double Rota-Baxter Lie bracket [a, b]R := a*Rb—b*Ra = a-Rb—b-Ra, known since the work of 
Semenov-Tian-Shansky [53]. We should mention that these little calculations become more 
transparent using the link between associative Rota-Baxter algebras and Loday's dendriform 
algebras [54,55]. 
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As a corollary to the last propositions we add the following identity which will also be 
useful later 

R(a - R b) = R([a, R(b)}) + 9ba) = R([b, R(a)}) + [R(a), R(b)} + 8R(ab) 

= R(b- R a) + [R(a),R(b)}, (7.13) 

which is another way of saying that 

R([a, b] R ) = R{a * R b - b* R a) = [R(a), R(b)} = R{a - R b-b- R a). 

The triple (A, S, R; 8) will denote an algebra A endowed with a skewderivation 5 and a 
corresponding Rota-Baxter map R, both of weight 9, such that R5a = a for any a £ A 
such that 5a ^ 0, as well as 5Ra = a for any a £ A, Ra £ 0. We check consistency of the 
conditions (7.9) and (7.4) imposed on R, 5. Respectively 

95R(ab) = R(a)b + aR(b) - 6(R(a)R(b)) = R(a)b + aR(b) - R(a)b - aR{b) + 9ab = 9ab; 
R5{ab) = R{a5{b)) + R(5(a)b) - 9R(5(a)5(b)) = R{a5{b)) + R(5(a)b) 
- R(a5(b)) - R(6(a)b) + ab = ab. 

The moral of the story is that Rota-Baxter maps are generalized integrals, skewderivations 
and Rota-Baxter operators being natural (partial) inverses. As an example we certainly 
have (C(M), d/dt, J; 0), with 5 = the derivative (with only the scalars in its kernel). Another 
example is given by the aforementioned triple (A, 5, Z; —9) of the finite difference map 5 of 
step 9 and the summation Z in (7.7). 

Rota-Baxter algebras have attracted attention in different contexts, such as perturbative 
renormalization in quantum field theory (see references further below) as well as generalized 
shuffle relations in combinatorics [56] . A few words on the history of the Rota-Baxter relation 
are probably in order here. In the 1950's and early 1960's, several interesting results were 
obtained in the fluctuation theory of probability. One of the better known is Spitzer's classical 
identity [57] for sums of independent random variables. In an important 1960 paper [58], the 
American mathematician G. Baxter developed a combinatorial point of view on Spitzer's 
result, and deduced it from the above operator identity (7.9), in the context where the 
algebra A is associative, unital and commutative. Then G.-C. Rota started a careful in depth 
elaboration of Baxter's article in his 1969 papers [59,60], where he solved the crucial "word 
problem", and in [61], where he established several important results. During the 1960's 
and 1970's, further algebraic, combinatorial and analytic aspects of Baxter's identity were 
studied by several people, see [62-64] for more references. Recently, the Rota-Baxter relation 
became popular again as a key element of the Connes-Kreimer [65-67] algebraic approach 
to renormalization. 

At an early stage the mathematician F. V. Atkinson made an important contribution, 
characterizing such algebras by a simple decomposition theorem. 

Theorem 7.3. (Atkinson [68]) Let A be an algebra. A linear operator R : A — > A satisfies 
the Rota-Baxter relation (7.9) if and only if the following two statements are true. First, 
A + := R(A) and A^ := (8 id — R){A) are subalgebras in A. Second, for X,Y,Z £ A, 
R(X)R(Y) = R(Z) implies (8 id — R) (X) (8 id — R) (Y) = -(did - R)(Z). 

This result degenerates in the case 8 = 0, whereby R = —R. A trivial observation is that 
every algebra is a Rota-Baxter algebra (of weight 1); in fact, the identity map and the zero 
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map are a natural Rota-Baxter pair. The case of an idempotent Rota-Baxter map implies 
9=1 and, more importantly, A_ PI A + = {0}, corresponding to a direct decomposition of A 
into the image of R and R. 

Atkinson made another observation, formulating the following theorem, which describes 
a multiplicative decomposition for associative unital Rota-Baxter algebras. 

Theorem 7.4. Let A be an associative complete filtered unital Rota-Baxter algebra with 
Rota-Baxter map R. Assume X and Y in A to solve the equations 

X = l A + R(aX) and Y = l A + R(Ya), (7.14) 

for a G A 1 . Then we have the following factorization 

Y(1 A - 6a)X = 1 A , so that 1 A — 6a = Y~ 1 X~ 1 . (7.15) 

For an idempotent Rota-Baxter map this factorization is unique. 

Proof. First recall that a complete filtered algebra A has a decreasing filtration { A n } of 
sub- algebras 

A = A D A 1 D • • • D A n D . . . 

such that A m A n C A m+n and A = lim A/A n , that is, A is complete with respect to the 
topology determined by the { A n }. Also, note that 

R(a)R(b) = R(R(a)b) + R(aR(b)), 

and similarly exchanging R and R. Then the product YX is given by 

YX = (l A + R(Ya))(l A + R(aX)) = l A + R(aX) + R(Y a) +R(Ya)R{aX) 
= l A + R(Ya(l A + R(aX))] + r((1 a + R(Y a)) ol) 
= l A + R(YaX)+R(YaX) = l A + eXaY. 

Hence we obtain the factorization (7.15). Uniqueness for idempotent Rota-Baxter maps is 
easy to show [69]. □ 

In summary, finite difference as well as g-difference equations play a role in important 
applications; thus it is useful to consider generalizations of the classical apparatus for solving 
differential equations. In the next section, by exploiting and complementing the CBHD 
development of the previous one, we make preparations to extend the work by Magnus 
on exponential solutions for non-autonomous differential equations to general Rota-Baxter 
maps, beyond the Riemann integral. 



8 The Spitzer identities and the CBHD recursion 

In the last section we mentioned Spitzer's classical identity as a motivation for Baxter's work. 
Now we spell out what that is. Spitzer's identity can be seen as a natural generalization 
of the solution of the simple initial value problem (1.1) on the commutative algebra A of 
continuous functions over R, 

&- = a(t)f(t), /(0) = 1, a E A. (8.1) 
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This has, of course, a unique solution f(t) = exp y J * a(u) du^j. Transforming the differential 
equation into an integral equation by application of the Riemann integral / : A — > A to (8.1), 

f(t) = l + I(af)(t), (8.2) 

we arrive naturally at the not-quite-trivial identity 



exp 



/ a(u)du) =exp(/(a)(t)) = l + V/(a/(a-- -7(a) •-•))(*). (8.3) 
Jo ' n=l O , > 



a-times 



Taking into account the weight-zero Rota-Baxter rule (7.3) for I, the last identity follows 
simply from 

(I(a)(t)) n = n! l(al(a ■ ■ ■ 1(a) • ■ ■ )) (t). (8.4) 



n-times 



Let now (A, i?) to be a commutative Rota-Baxter algebra of weight 9 7^ 0. We formulate 
Spitzer's finding in the ring of power series A[[£]], which is a complete filtered algebra with 
the decreasing filtration given by the powers of t, A n := t n A[[t}}, n > 0. Notice that the 
power series algebra A[[t\] with the operator 1Z : A[[t}} — > A[[t}} acting on a series via R 
through the coefficients, 1Z (J2 n >o an ^ n ) := 5^ n >o R(. a n)t n i is Rota-Baxter as well. Then we 
have 

Theorem 8.1. (Spitzer's identity) Let (A, R) be a unital commutative Rota-Baxter algebra 
of weight 6^0. Then for a G A, 



n=0 

n-times 



in the ring of power series A[[t]]. 

Analytic as well as algebraic proofs of this identity can be found in the literature, see 
for instance [61,67]; and anyway it is a corollary of our work further below. Observe that 

—O^ 1 log(l — a6t) — > at. Thus indeed (8.5) generalizes (8.3). 

Moreover, identity (8.4) generalizes to the Bohnenblust-Spitzer formula [61] of weight 9. 
This is as follows. Let (A, R) be a commutative Rota-Baxter algebra of weight 9 and fix 
si, . . . , s n G A, n > 0. Let S n be the set of permutations of {1, ... , n}. Then 

j2 R(s f7{1) R{s a{2) ■ ■■R( Sa{n) ) •••))= e ° n ~ m n d T i - ^ ^(n s ) » ( 8 - 6 ) 

Here T runs through all unordered set partitions of {1, . . . , n}; by \T\ we denote the number 
of blocks in T; by \T\ the size of the particular block T. The Rota-Baxter relation itself 
appears as a particular case for n = 2. The weight 9 = case reduces the sum over T 
to |T| = n: 

n 

J2R(s a{1) R{s a{2 yR( Sa(n) ) ■■■))= Y[R{ S] ). (8.7) 

o-es„ i=i 
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Also, for n > and Si = • • • = s n = x we find in (8.6): 

r[xR(x---R{x)---)\ = — } en ~ ]T] Y[(\ T \ ~ IY-R{x m ). (8.8) 

Relation (8.6) follows from Spitzer's identity (8.5) by expanding the logarithm and the 
exponential on the left hand side, and comparing order by order the infinite set of identities 
in A[[t]]. 

Spitzer's classical identity constitutes therefore an interesting generalization of the initial 
value problem (8.1), respectively the integral equation (8.2), to more general integration-like 
operators R, satisfying the identity (7.9). Again we refer the reader to [61,64] for examples 
of such applications in the context of renormalization in perturbative quantum field theory, 
g-analogs of classical identities, classical integrable systems and multiple zeta values. Also, 
Atkinson's factorization Theorem 7.4 is obvious from Spitzer's identity. The right hand 
side of identity (8.5) is a solution to X = 1a + tR(aX) in A[[t]] corresponding to the 
factorization of the element 1a — Oat. (One ought to be careful here, since Spitzer's identity 
as well as (8.4) are only valid for commutative Rota-Baxter algebras of weight 9, whereas 
Atkinson's factorization result applies to general associative unital Rota-Baxter algebras.) 

Let us adopt an even more general point of view. For functions with image in a noncom- 
mutative algebra, say n x n matrices with entries in M, relation (8.3) is not valid anymore 
as a solution to (8.1); nor is identity (8.4) valid. From our present perspective, however, it 
does seem quite natural to approach the problem of finding a solution to the initial value 
problem, as well as relations (8.3) and (8.4), on a noncommutative function algebra A by 
looking for a generalization of Spitzer's identity to noncommutative unital associative Rota- 
Baxter algebras of weight 9. This latter problem was finally solved in [70,71] — see also [69], 
where the reader may find more detail and earlier references. We will review briefly those 
results, prior to extend our findings by indicating a noncommutative generalization of the 
Bohnenblust-Spitzer formula. 

We then take the first steps towards the noncommutative Spitzer identity. Let A be a com- 
plete filtered associative algebra. Bring in from Section 6 the Campbell-Baker-Hausdorff- 
Dynkin (CBHD) formula for the product of exponentials of two non-commuting objects x, y 

exp(x) exp(y) = exp(x + y + CBHD(x,?/)), where ^ $ m (x, y) =: CBHD(x, y). 

m>2 

Now let P : A — > A be any linear map preserving the filtration and P = 9 id — P, with 9 an 
arbitrary nonzero complex parameter. For a G A 1 , define the nonlinear map 




where X( n ^( a ) is given by the so-called CBHD recursion, 
X(o) 0) : = a, 

X(ii)(«) = «-JcBHD(P( X ^(a)),P( X ^(a))), (8.9) 

and where the limit is taken with respect to the topology given by the filtration. Then the 
map x d ' P '■ A 1 — > A 1 satisfies 

X 9 ' P (a) = a-]- CBRD(P(x e > P (a)),P(x e > P (a))). (8.10) 
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We call x e ' P the CBHD recursion of weight 9, or just the 0-CBHD recursion. In the following 
we do not index the map x e { a ) := X d ' P by the operator P involved in its definition, when it is 
obvious from context. One readily observes that \ e reduces to the identity for commutative 
algebras. 

The following theorem states a general decomposition on the algebra A implied by the 
CBHD recursion. It applies to associative as well as Lie algebras. 

Theorem 8.2. Let Abe a complete filtered associative (or Lie) algebra with a linear, filtration 
preserving map P : A — > A and P := 9 id — P. For any a G A 1 , we have 

exp(fla) = exp(P( X »)) exp(P( X e (a))). (8.11) 

Under the further hypothesis that the map P is idempotent (and 8 = 1), we find that for any 
x G 1a + A 1 there are unique x_ G exp(P(A 1 )) and x + G exp(P(A 1 )) such that X — x_ x+ . 

For proofs we refer the reader to [69]. Using this factorization one simplifies (8.10) 
considerably. 

Lemma 8.3. Let A be a complete filtered algebra and P : A — > A a linear map preserving 
the filtration, with P as above. The map x 9 in (8.10) solves the following recursion 

X 9 {u) :=u+^CBRD(eu,-P(x 6 («))), u G A 1 . (8.12) 

The convolution algebra (Hom(i7, A), *), for H a connected graded commutative Hopf 
algebra, will also be complete filtered. We may immediately apply the above factorization 
theorem, giving rise to a factorization of the group Gh{A) of A-valued characters, upon 
choosing any filtration-preserving linear map on Hom(H, A). In fact, we find for 9 = 1 in 
the definition of x the following result. 

Proposition 8.4. Let A be a commutative algebra and H a connected graded commutative 
Hopf algebra. Let P be any filtration preserving linear map on Hom(H, A). Then we have 
for all (p G Gh(A) and Z := log(0) G gii(A) the characters 4>Z l '■= expfP(%(Z))J and 
:= exp(P(x(Z))) such that 

= 0-!*0+. (8.13) 
If P is idempotent this decomposition is unique. 

A natural question is whether one can find closed expressions for the map x 6 '■ The answer 
is certainly affirmative in some non-trivial particular cases [69] . 

Corollary 8.5. In the setting of the last proposition we find for the particular choice of 
P = 7r_ : H — ► H being the projection to the odd degree elements in H (hence 9 = 1) 

X (Z) = Z + CBHD (z, -ir-(Z) - \ CBHD (Z, Z - 7r_(Z))V Z g g H (A). 

Before proceeding, we must underline that the factorization is due solely to the map x d \ 
in fact, the map P — respectively P — involved in its definition has only to be linear and 
filtration preserving. The role played by this map is drastically altered when we assume it 
moreover to be Rota-Baxter of weight 9, on a complete filtered Rota-Baxter algebra. This 
we do next, to rederive and generalize the Magnus expansion. Also, it will soon become clear 
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what x° is. One of the aims of this paper is to attack the solution of the CBHD recursion 
when P is Rota-Baxter. 

We noted earlier Atkinson's multiplicative decomposition of associative complete filtered 
Rota-Baxter algebras. Let from now on (A, R) denote one such, of weight 8^0. Observe 
the useful identity 

n n n 

6^R{xi) = R(j[R{xi)-Y[R(-xSj, for Xi G A, % = 1, . . . , n. (8.14) 

i=l i=l i=l 

This comes from the Rota-Baxter relation (7.9). The case n = 2 simply returns it. The 
reader should check it with the help of the double Rota-Baxter product (7.10). In the 
following we consider x° := X 9,R on A 1 . Using (8.14), for 0~ 1 log(l J 4 — da) =: u G A 1 one 
readily computes 

^{-R{ X \u))) = l A + Y, R{ ~ Xd{U))n 



n>0 



= 1a + E y-W(«)) n - (-%V)n) 

n>0 

= 1 A + ^R(exp(-R( X 9 (u)) - exp(R( X 6 (u))) =1 A + R(aexp(-R( X e (u)) 

In the last step we employed the factorization Theorem 8.2 corresponding to \ d ■ Therefore, 
on the one hand we have found that X := exp(— R(x e (u))) G 1^+A 1 solves X = 1a+R(ciX), 
one of Atkinson's recursions in Theorem 7.4. On the other hand, a solution to this recursion 
follows from the iteration 

X = 1 A + ^2R(aR(aR(a ■ ■ ■ R(a)) ...)). (8.15) 



n>o v "> — 

n times 

Hence: 



Theorem 8.6. The natural generalization of Spitzer's identity (8.5) to noncommutative 
complete filtered Rota-Baxter algebras of weight 9 ^ is given by 

exp (- R ( X s ( l0g(1 A ~ 6a) )))= ^ RjaRWa • • • R(a) ) ■■■)), (8.16) 



n=0 

n times 



for a G A 1 . Recall that x e reduces to the identity for commutative algebras, yielding Spitzer's 
classical identity. 

So far we have achieved the following. First we derived the general factorization The- 
orem 8.2 for complete filtered algebras, upon the choice of an arbitrary linear filtration 
preserving map. Specifying the latter to be Rota-Baxter of weight 6, that is, identity (8.14), 
we have been able to show that Atkinson's recursion equations in Theorem 7.4 have expo- 
nential solutions. It is now natural to ask whether the Bohnenblust-Spitzer formula (8.6) 
valid for weight-^ commutative Rota-Baxter algebras can be generalized to noncommutative 
ones. The answer is yes! We outline next this generalization, postponing detailed proof to 
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the forthcoming [72], to keep this long work within bounds. First, by using the pre-Lie (7.12) 
and the double (7.10) Rota-Baxter products, we find 

R(xiR(x 2 )) + R(x 2 R(xi)) = R(xi)R(x 2 ) + R(x 2 - R xi) = R{x 1 * R x 2 ) + R(x 2 - R x x ). 

Recall the relations (7.11). One may now check by a tedious calculation that 

E R[x ai R(x a2 R(x a3 ))j = R(x! * R x 2 * R x 3 ) + R((x 2 - R x{) * R x 3 ) 

+R((x 3 - R xx) * R x 2 ) + R(x 3 -r (x 2 ■ r xx)) + R(x! * R (x 3 - R x 2 )) + R(x 2 - R (x 3 - R xi)) 
= R(xi)R(x 2 )R(x 3 ) + R(x 2 ■ r xi)^(x 3 ) + R(x 3 - R xi)R(x 2 ) 

+R(x 3 -r (x 2 - R xt)) + R{xi)R{x 3 - R x 2 ) + R(x 2 - R (x 3 - R xi)). (8.17) 
We obtain special cases of the above when x\ = x 2 = x 3 = x 

3\R(xR(xR(x))^j = R(x *r x * R x) + 2R({x • r x) *rx) + 2R(x - R (x - R x)) 

+ R(x* R (x ■ r x)) = R(x) 3 + 2R(x -r x)R(x) + 2R(x - R (x - R x)) + R(x)R(x - R x). 

Equation (8.17) is an instance of the following result, that seems to be new: 

Theorem 8.7. Let (A, R) be an associative Rota-Baxter algebra of weight 9. For Xi & A, i = 
1, ... ,71, we have 

<T&S n <T(zSn 

where 



X ai <>i X n 2 ■ ■ ■ O n X(j 



X (7i ®i X ai+1 



x<n *r x ai+1 , <Ti < min(o-j \i < j, 
x<n ■RX<r i+1 , otherwise; 



furthermore consecutive ■ r products should be performed sfrom right to left, and always before 
the *r product. 

This is the noncommutative Bohnenblust-Spitzer formula. Obviously, in the commutative 
case, i.e., when a- R b = 6ab, we just recover the classical Bohnenblust-Spitzer identities (8.6) 
and (8.8). On the other hand, anticipating on coming sections, the case 6 = reduces to 
Lam's factorization theorem [73], stated in the context of a weight-zero Rota-Baxter algebra 
of operator valued functions (B,J;0). Let us adopt Lam's notation for the n-fold right 
bracketed pre-Lie product by 

C* := C*{x) := R(x - R (x - R . . . (x - R x) . . . )). (8.18) 

Also, we introduce a notation for the so-called Rota-Baxter words: 

(Rx) [n+1] = R(x(Rx) [n] ), (8.19) 

with the convention that (Rx)^ = 1. Then we obtain the general expression 

n fiR. fiR 

(^) W = E E TTr 1 ? 1 ' "n ~ T~v ( 8 - 20 ) 

fcl-i \-h=n 
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That is to say, we sum over the compositions of n. The simplest cases already examined 
now are written 

2\(Rx) [2] := 2\R(xR(x)) = (Cf f + , 
3!(Px) [3] := 3\R(xR(xR{x))^ = (Cf ) 3 + 2CfCf + Cf + 2C*. (8.21) 

Later we make use of those expansions in relation with the Magnus expansion and the Dyson- 
Chen series. In fact, the left hand side of the above expressions are the second and third 
order terms in the path- or time-ordered expansion, in the context when the map R is the 
Riemann integral. We just generalized this to general-weight Rota-Baxter operators. 



9 The zero-weight recursion 

Let us come back to the CBHD recursion \ e . The question of the limit 6 { becomes subtler 
than in the commutative case, due to the particular properties of relation (8.10). Now, in 
general we may write $(a, b) = 3>i(a, b) + CBHD (a, b) as a sum 

$(a,6) =^i? n (a,&), 

n>l 

where H n (a, b) is the part of <&(et, b) which is homogenous of degree n with respect to a. For 
n = 1 we have [34]: 

ad b , v ad b ( , ad b 

e ad& _ 1 



ffxM) = oadfc - (a) = -^(coth^ - l)(a). (9.r 



In the limit ^10 all higher order terms H n> i vanish and from (8.12) we get a nonlinear 
map x° inductively defined on A 1 by the formula 

x» = ^©-W = (u + E^[ adp ^ 0(a) )]1 (a) - (9 - 2) 

A ^ n>0 ' 

We may call this the weight- zero CBHD recursion. The coefficients are b n := B n /n\ with S n 
the Bernoulli numbers. For n = 1,2,4 we find b\ = —1/2, 62 = 1/12 and 64 = —1/720. We 
have 63 = 65 = • • • = 0. The first terms in (9.2) are then easily written down: 

X °(a) = a - i[P(a), a] + i [P([P(a), a]), a] + ^[P(a), [P(a),a]] (9.3) 

-ip([P([P(fl),[P(fl), a ]]), ffl ] + [P(o),[P([P(o),a]),a]] + [[P([P(^),a]),[P(a),a]] 

-lp([P([P([P(a),a}),a}),a] 

We pause here to note that (9.1) is but an avatar of the formula 

j g— ad a 

D(e a ) = e a Da, for D a derivation; 

ad a 

a noncommutative chain rule familiar from linear group theory. See [74, Chapter 1] for 
instance. Apparently this is due to F. Schur (1891), and was taken up later by Poincare and 
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Hausdorff. One may also consult the charming account of the determination of a local Lie 
group from its Lie algebra, using canonical coordinates of the first kind, in [75, Chapter 13]. 
The appearance of the Bernoulli numbers is always fascinating. The deep reason for it is 
that we are trying to express elements of the enveloping algebra in terms of the symmetric 
algebra. 

Now suppose P is a weight-zero Rota-Baxter operator, denoted R henceforth. The non- 
commutative generalization of Spitzer's identity in the case of vanishing weight is captured 
in the following corollary. 

Corollary 9.1. Let {A,R) be a complete filtered Rota-Baxter algebra of weight zero. For a 
in A 1 the weight-zero CBHD recursion x° '■ A 1 — * A 1 is given by equation (9.2); 



X°(a) 



&dR(x°(a)) 

3 adR(x») _ l 



Moreover: 

1. The equation x 

2. The equation y 



1 + R(ax) has a unique solution x = exp(i?(x°(a))) . 
1 — R(ya) has a unique solution y = exp(i?(— x°(a))) 



We will see pretty soon that the 0-CBHD recursion gives Magnus' expansion. The dia- 
gram (9.6) further below summarizes the foregoing relations, generalizing the simple initial 
value problem (8.1) in a twofold manner. First we go to the integral equation (8.2). Then we 
replace the Riemann integral by a general Rota-Baxter map and assume a noncommutative 
setting. That is, we start with a complete filtered noncommutative associative Rota-Baxter 
algebra (A, R) of nonzero weight 9 in the appropriate field. The top of the diagram (9.6) 
contains the solution to the equation 

a e A 1 , 



X = l A + R{a X), for 
generalized to associative, otherwise arbitrary Rota-Baxter algebras (8.16) 

'log(l^ - 6a) 



X = exp -R[x 







(9.4) 



(9.5) 



The 0-CBHD recursion \ B is given in (8.10). The left wing of (9.6) describes the case when 
first the weight 9 goes to zero, hence reducing \ 9 ~* X°'i see (9-2). This is the algebraic struc- 
ture underlying Magnus's Q series of the next section. Then, we let the algebra A become 
commutative, which reduces x° to id. The right wing of diagram (9.6) just describes the 
alternative reduction, i.e., we first make the algebra commutative, which gives the classical 
Spitzer identity for nonzero weight commutative Rota-Baxter algebras, see (8.5). Then we 
take the limit 8 10. 



exp(- J R(x e (r 1 log(l-0a)))) 

8y^0, noncom 



(9.6) 



exp(R( X °(a))) 

Magnus 





com 
6»|0 



exp(- J R(0- 1 log(l -6a))) 

Spitzer 



exp(_R(a)) 

9=0, com 
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Both paths eventually arrive at the elementary fact that equation (9.4) is solved by a simple 
exponential in a commutative, weight-zero Rota-Baxter setting. We have succeeded in 
finding the general algebraic structure underlying the initial value problem for generalized 
integrals, that is, Rota-Baxter operators. 



10 On Magnus' commutator series 

It is high time for us to declare why we choose to deal with first order non-autonomous dif- 
ferential equations primarily via the Magnus expansion method. The latter has a somewhat 
chequered history. To attack the initial value problem of the type (1.1): 

j t F(t) = a(t)F(t), F(0) = 1, (10.1) 
with F a matrix-valued function, say, Magnus proposed the exponential Ansatz 

F(i) = exp(ft[a](*)), 
with f2[a](0) = 0. He found a series for Q[a}: 

n[a](t)=Y,ton[a]{t), (10.2) 

n>0 

in terms of multiple integrals of nested commutators, and provided a differential equation 
which in turn can be easily solved recursively for the terms fl n [a] (t) 



d _ w . adfifa 



It is worth indicating that originally Magnus was motivated by Friedrich's theorem of our 
Section 5. We already mentioned, however, that one of the papers most influential on 
the subject [43] was written without knowledge of Magnus' paper. In the 1990's, several 
mathematicians interested in approximate integrators for differential equations developed 
the discipline of geometrical integration. Originally also unaware of Magnus' work, they 
derived anew Magnus' expansion. The point was to make sure that the approximate solutions 
evolve in the Lie group if in (4.4) remains in the Lie algebra. This is not true of the 
iterative Dyson-Chen method — no finite truncation of the latter is the exact solution of 
any approximating system. By construction x° respects the Lie algebra structure in (4.4), 
and thus truncations of the series are sure to remain in the Lie group. This is one reason 
why — in view also of the considerations in Appendix C — we give priority to the Magnus 
method. For geometrical integration, consult [76,77]. 

Comparison with (9.2) settles the matter of the link between Magnus series and the 
CBHD recursion in the context of vanishing Rota-Baxter weight. Namely, 

Corollary 10.1. Let A be a function algebra over M. t with values in an operator algebra. 
Let R denote the indefinite Riemann integral operator. Magnus ' Q expansion is given by the 
formula 

n[a](t) = R( X °(a))(t). 
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In conclusion, we could say that the 0-CBHD recursion (8.10) generalizes Magnus' ex- 
pansion to general weight 0^0 Rota-Baxter operators R by replacing the weight-zero 
Riemann integral in F = 1 + R{aF}. Corollary 9.1 represents a more modest generalization, 
to zero-weight Rota-Baxter operators different ^from the ordinary integral. 

Let us write explicitly the first few terms of the Magnus expansion using (9.3), when R 
is the Riemann integral operator. The function a = a(t) is defined over M. and takes values 
in a noncommutative algebra, say of matrices of size n x n. We obtain 

R(a){t) = [ a(ti)dti, (10.4) 
Jo 

- ^R([R(a), a])(t) = 1 jT ^[a^^ih^dhdh, 

-R^[R([R(a), a}), a\yt) = -j J J [[a(t 3 ), a(t 2 )\, a^dtadhdh, 
— R(jR(a),[R(a),a]]yt) = — / / / [a(t 3 ), [a(t 2 ), a(ti)]] cft 3 cZt 2 



This gives indeed the first terms of the expansion precisely in the form that Magnus derived 
it. However, in later works [43-45,78,79] the terms in the Magnus' expansion are presented as 
iterated commutator brackets of strictly time-ordered Riemann integrals. Especially in [43] 
Strichartz succeeded in giving a closed solution to Magnus' expansion — and hence to our 
recursion x°' R when R is the Riemann integral. With the notation of Proposition 6.2, he 
found 

Q[a](t) = ^2n n [a]{t), with (10.5) 

n>0 

Vn[a](t) = V \ / / .../ l[...[a{t 1 ),a{t 2 )]...\,a{t n )]dt n ...dt 2 dt 1 . 
CTe5 „ n W)) J o Jo Jo 

This formula clearly points to the close relation between the CBHD expansion and Magnus' 
series, although the appearance of the number of descents is still 'unexplained'. However, it 
is not to everyone's taste. It is immediate from the formula that 

^2[a](t) = lf f [a(ti),a(t 2 )]dt 2 dti, 
* Jo Jo 

coincident with the second term in (10.4); and clear enough that 

n 3 [a}(t) = -J J J ([[a(ti),a(t 2 )],a(t 3 )] - [[a(t 2 ),a(t 3 )],a(ti)]) ^3^2^*1; (10.6) 

however the number of terms grows menacingly with nl, it is never evident when we will find 
cancellations, and one quickly concludes that the beauty of (10.5) hides its computational 
complexity. Nor is it entirely obvious, although of course it is true, that (10.6) coincides 
with the sum of the third and fourth terms in (10.4). 

The best policy, in our opinion, is to invoke the alternative Dyson-Chen solution at this 
point. This attacks three problems: systematic writing of the Magnus series simplifies; the 
zero-weight recursion is solved; and the comparison between different expressions for the 
same terms is made easier. 
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11 Enter the Dyson— Chen series 

The first order initial value problem (10.1), respectively the corresponding recursion 

F(t) = l + R(aF)(t), 

where R is the Riemann integral operator, possess a natural solution in terms of iteration, 
see (8.15). The resulting infinite series is called here Dyson-Chen integral. In the physics 
literature those series are often referred to as time-ordered exponentials or path-ordered 
integrals; their importance can hardly be overstated. To reflect such nomenclature in the 
notation, write 

Te f*a( tl )d tl =Te R(a)(t) ._ i + V] J R(a J R(a J R(a---i2(a))... ))(*)• 

n>0 v 

n times 

The operator T implies the strict iteration of the integral corresponding to the 'time ordering'. 
A short presentation of Chen's work on this kind of integrals can be found in [80] ; the findings 
of Magnus and Chen played a decisive role, especially for Rota and his followers. Directly 
from the group property of the flow, we have for the Dyson-Chen integral the factorization 

giving rise to many identities of integrals and concatenation products of series, which we need 
not go into. This factorization might be compared with the quite different decomposition 
induced by the CBHD recursion (8.11). The major result of the theory is the following 
theorem. 

Theorem 11.1. The logarithm of a Chen series is a Lie series. 

The direct proof of this statement uses Hopf algebra, to wit, the shuffle product algebra 
of our Section 5. It is just a matter of verifying Ree's condition (5.2) inductively. In our 
present context, the Dyson-Chen expansional is the solution to Atkinson's recursion (9.4), 
and the theorem scarcely needs justification. 

Simply by taking the logarithm in 

exp(Q[a](t)) =Te fl(a)W , 

we obtain 

^N=^^— E (Jto)"W...(Jto) M . (H.l) 

k=i h,...,i k eN* 

iiH hifc=n 

This was of course known to the practitioners — see [79, 81] and references there. It is 
derived in [3] by use of the Faa di Bruno Hopf algebra. Inverting these relations, one finds 
the (i?a)t n+1 l's in terms of the Q m [a]'s 

n 

(A«) W = E]ti £ n h [a]...n lk [a]- (n.2) 

fc=i ' h,...,l k ew 

iiH \-h=n 

The first examples are: 

2\(Ra) [2] = nj[a] + 2Q 2 [a], 
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3!(i?a) [3] = fif [a] + 3(ft?[a]ft 2 [a] + Q 2 [a]^[a]) + 6fi 3 [a]. 

that might be compared with (8.21); of course Ra = fli[a] = Ci in the occasion. Now, both 
sets of equations (11.1) and (11.2) simply describe how to link Magnus' expansion to the 
Dyson-Chen expansional. They purely follow from the Rota-Baxter relation as well as the 
CBHD formula. Therefore they are valid for any weight-zero Rota-Baxter operator R. 
By inverting the Rota-Baxter map, we solve moreover the zero-weight CBHD recursion: 



'- (_-nfc+i 

X» = E -^T— E «Ra) [h - 1] (Ra) lh] ■ ■ ■ (Ra) M + 



k 

k=l h,l 2 ,...,l k &N* 
h+h + -+lk=n 



+ (Ra)^a(Raf^ ■ ■ ■ (Raf k] + ■■■ + {Ra)M{Ra)M ■ ■ ■ a(Ra) [l "- 1] ) . 

Next we are set to give an alternative formula to (10.5), keeping left-to-right bracketing. 
This is better explained by way of example. Bring in Heaviside's step function, 



ei, 2 (ti,t 2 ) := ®(h-t 



2, 



1, if h-t 2 > 0, 
0, otherwise; 



with its help, iterated Riemann integrals can be rewritten 

R(aR(b))(t)= / a(ti) / b{t 2 )dt 2 dt l = I I Q{t 1 -t 2 )a{tx)b{t i )dt 2 dtx. (11.3) 
Jo Jo Jo Jo 

More generally, 

Qi,j(ti,t 2 , ...,t n ) := Q(U - tj), for 1 < i, j < n, 

and we can write 

oo r t 



r - - r r 

Jo n=2 Jo Jo 



a(t\) ■ ■ ■ a(t n ) dt n ■ ■ ■ dt\. 



For instance, for the third term of the Magnus series, applying (11.1), 

fi sH = / / / (©lAa- |©i,2- \e2,z + \)a{ti)a{t 2 )a{h)dhdt 2 dt x . 
Jo Jo Jo 

Now, we know — if only from theorem (11.1) — this is a Lie element, so we can apply at once 
the Dynkin operator to rewrite it with nested commutators: 

n 3 [a]=/ / / (0i,2© 2 ,3-|©i,2-|©2,3 + |)[[a(ti),a(t2)],a(t3)]^3^2^i. 
Jo Jo Jo 

We see now that the last term actually does not contribute to the integral. With very little 
work, just using 0i j2 + Q 2>1 = 2 3 + 3j2 = 1, one recovers (10.6). An explicit formula for 
all terms along these lines, fully equivalent to, but simpler to work with, than Strichartz's, 
is easily obtained [45]; we do not bother to write it. We must avow, however, that we do 
not see a way to write terms like the third one in the integral above as a combination of 
iterations and products of the R operators; thus we must conclude that formulae like (10.5) 
and (10.6) are only valid for the Riemann integral. 
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For general zero- weight Rota-Baxter operators we may fall back on (11.1). Magnus him- 
self did not use any property of the map R beyond integration-by-parts, and only presented 
the expansion in a form equivalent to (9.3). Of course, even using purely the weight-zero 
Rota-Baxter relation, there are many equivalent ways of writing the same. For instance, 
simply by Proposition 7.1, one finds that the term at third order in Magnus' expansion (10.4) 
is rewritten 

l -R{[R{[R{a), a}) ,«])(*)- ^ [R([R(a), a]) , R(a)] (t). 

It is worthwhile to mention that Iserles and Norsett use binary rooted trees to achieve a 
better understanding of Magnus' expansion [76,77]. 

We have long taken the algebraic tack. But what about convergence of the Magnus series? 
Note that Dyson-Chen series converge absolutely for all t if a is bounded, and this is why 
they are preferred in quantum field theory; however this good property is not transmitted in 
general to Magnus series via (11.1), as there is an infinite resummation involved. Excellent 
bounds at small t have been found recently [82] for matrix systems. Strichartz linearizes 
arbitrary initial-value problems, for which we cannot expect convergence in general in the 
smooth category; but he does not fail to observe that Magnus' expansion has especially 
good properties for Lie-Scheffers systems [43, Section 3]. This is because the closing of the 
involved vector fields to a finite- dimensional Lie algebra sharply improves the estimates. 
Furthermore, for those systems Magnus' exponential can be interpreted as the exponential 
map of Lie theory. 

12 Towards solving the ^-weight recursions 

Let us now come back to Proposition 8.7 and take the first steps in going from the Dyson- 
Chen series to the ^-weight CBHD recursion. This looks somewhat hard; but recall that Lam 
found, in the context of the Riemann integral, another way to relate the terms in the Dyson- 
Chen series to those in the Magnus expansion — consult [73,83]. In fact, Lam's findings are 
true in a much more general sense, i.e., for general weight Rota-Baxter algebras, as we will 
indicate here. The attentive reader will remember the weight-zero pre-Lie product (7.12), 
that allows for the following way of writing the weight zero CBHD recursion x°( a )> see (9.3): 

x °(a) = a + ^a- R a+ (~(a - R (a - R a)) + — ((a - R a) - R a)^ 

+ -ft ( a 'R (( a 'R a ) 'R a ) + ( a R ( a 'R a )) 'R a + ( a 'R a ) 'R ( a 'R a )) 

+ g#(a-B {a- R (a- R a))^j + ■•■ . (12.1) 

This contains in germ the main idea. Remember (8.18) in terms of the (double and) pre-Lie 
Rota-Baxter product. Lam made an exponential Ansatz 

n>0 m>0 

and derived the following formulae for the K^s in terms of C^(a), . . . , C^(a): 

K^a) = C*la), K 2 (a)= 1 -C 2 R (a) J K 3 (a) = lcf(o) + ±[C?{a),C?(a)], 
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K A {a) = l C«(a) + -[CK(a),C«(a)},... 

The weight-^ Rota-Baxter relation enters at the level of identity (8.20), hence implying the 
particular form of the KjS. This naturally demands a comparison with the CBHD recursion, 
respectively the generalized Spitzer identity. 

Theorem 12.1. Let (A,R) be an associative Rota-Baxter algebra of weight 9. Then for 
Ki = Ki(C?(a), . . . , Cf (a); 9) we have 

^2 = -R( X e {9- 1 \og(l A - Oat))). 

i>0 

Hence, finding a formula for the K^s gives a solution, in the sense of a closed expression, 
to the CBHD recursion x 6 > which follows from the Rota-Baxter relation. A full proof of 
this statement lies beyond the scope of this work and will be provided elsewhere [72] . In the 
context of Hopf and Rota-Baxter theory, it is the generalization of the shuffle relation to 
the quasi-shuffle (or mixed-shuffle) identity [84] underlying the algebraic structure encoded 
in the Rota-Baxter relation of nonzero weight, which generalizes the integration-by-parts 
rule (7.3) corresponding to the shuffle relation. 

Of course, when 9 = Lam's K^s are just the Magnus fij's. These are expressed as 
sums of commutators of right-to-left bracketed integrals, when R is the Riemann operator. 
This turns out to be the most efficient method for the expansion, as well. For instance, the 
expression of K 5 contains just six terms, whereas f2 5 is written usually with 22 terms [83]. 

We close this section with a simple but striking observation flowing ^from the last the- 
orem. Defining u(at) := ^ _1 log(l^ — Oat), we recover — x°{u{at)) from x°(at), that is, 
from (12.1), simply by using the weight-^ pre-Lie product 7.12). A full proof of this statement 
will be given elsewhere. But we show this here up to third order. Using 9~ x log(lA — Oat) = 
-En>o^(«^wefindfor 

-X e (u(at))=at-J2Xn« a )) tn+1 

n>0 

the following 

xfiMa)) = ±6a 2 -±[R(a),a], 

X( 2) («(a)) = -\9([R(a 2 ),a] + [R(a),a 2 ]) +±[a, R([a, R(a)})] 

+1 ( [[a, R(a)} , R(a)] - 9 [a, [a, R(a)}] ) . 

Let us go back to (12.1) and use the pre-Lie product a -^b := [a, R(b)\ + 9ba of (7.12). We 
obtain at second order 

111 
-a- R a = -[a, R(a)] + -9a 2 . 

At third order we calculate: 

t {a -r {a - R a)) + —((a -r a) - R a) 
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= +j[a, R(a - R a)} + -6 (a - R a) a + —[(a - R a), R(a)} + -^9a(a - R a) 
= l - [a, R([a, R(a)])] + ±6[a, R(a 2 )] + ±6[a, R(a)]a + -fa 3 

+ ^ [[a, R(a)}, R(a)] + ^9[a 2 , R(a)} + ^a[a, R(a)} + ^9 2 a 3 

= \o 2 a 3 + 1 [[a, i?(a)], R(a)} + ~ [a, i?([a, R(a)})} + ~0([a, i?(a 2 )] + [a 2 , i2(a)]) 

111 

+ -6*[a, R(a)}a 9[a 2 , R(a)] H 8a[a, R(a)} 

A 6 12 

= \o 2 a 3 + ^9([a, R(a 2 )] + [a 2 , R(a)]) + \ [a, R([a, R(a)})} 

+ ±[[a,R(a)lR(a)]-±6[a, [a,R(a)}]. 

Earlier in Section 10 we have seen how the Magnus expansion naturally follows from the 
CBHD recursion in the limit 6 { 0. In turn we see here the advantage of reformulating 
Magnus' expansion in terms of the Rota-Baxter pre-Lie product of weight 9 yielding the 
CBHD recursion. 



13 Conclusion and outlook 

Our purpose in this paper was twofold. Starting from the innocent-looking dynamical sys- 
tem (1.1) — of classical Lie-Scheffers type when G is an ordinary Lie group — we sought 
to reformulate it in Hopf algebraic terms, thus being led to generalized derivation and in- 
tegration (Rota-Baxter) operators. Whereby we show that two of the three main ordinary 
strategies to attack non-autonomous linear differential equations (linked respectively to the 
names of Magnus and Dyson-Chen) still make sense in the broader context. In particular, 
the noncommutative version of the Bohnenblust-Spitzer identity has been found, and we 
blaze a trail to solve the nonlinear recursion introduced earlier by one of us in relation with 
the noncommutative Spitzer formula. 

There is no doubt that the product integral method to attack (1.1), often linked with the 
name of Fer [85] , is also susceptible to our kind of algebraic reinterpretation and generaliza- 
tion. However, with a heavy heart, we leave this for a later occasion: the present paper is 
already long enough. 

Needless to say, the programmatic purpose of this work was to propagandize the Hopf 
algebra approach to differential equations. The lure of presenting classical subjects under a 
new light explains why we spent much space on a smooth transition from the standard to a 
Hopf-flavoured view of dynamical systems; and indeed this article became a powerful spur 
to revisit the traditional proof of the Lie-Scheffers theorem, and plug its gaps [17]. On the 
other hand, many of our findings and procedures will surely not raise an eyebrow of people 
working in sophisticated methods for control theory — on which we confess no expertise. 
There is, at any rate, plenty left to do. Avenues open for possible research include: 

• The Carinena-Ramos' approach to Lie-Scheffers systems, based on connections, should 
be recast in the noncommutative mould, in the light of [20] and [30]. 

• To relate and compare the action algebroid approach to group & Lie algebra actions 
with the Hopf algebra approach. 
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Investigation of the product integral method. 

Further exploration of the theory of Rota-Baxter operators as natural inverses to 
skewderi vat ions; that is, developing Rota's proposal of an algebraic theory of integra- 
tion. 

Definitive clarification of the noncommutative Spitzer formula and the noncommutative 
Bohnenblust-Spitzer identity in the light of Lam's expansion. 

The bridge to control theory and chronological products, via Loday's dendriform alge- 
bras in particular, should be enlarged and strengthened. In this respect, Lie-Butcher 
theory [86, 87] shows great promise. 
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A Precis on group actions 



Definition 4. A (left) action of a Lie group G on a manifold M is a homomorphism $ of G 
into Diff M. For x G M, and jeGwe denote 

$ g := $(#) and x) := <3> 9 x. 

A right action is just an antihomomorphism of G into Diff M. The orbits of <3> are the subsets 
of M of the form $(G, x) for a fixed x G M; they are homogeneous manifolds, on which the 
action is transitive. We will call $ x the map from G to M defined by g i— > $(g,x). Recall 
that a flow is an action of R on M. When $ with the indicated properties is given, we say 
M is a G-manifold. A Lie group action is proper if given any pair K, L of compacts subsets 
of M, the set {g G G : gA' n L ^ } is compact. The stabilizer or isotropy subgroups are 
then compact. Proper actions, in particular compact group actions of general Lie groups, 
have good properties: for instance the orbits of a proper action are closed submanifolds 
of M [88]. An action is faithful (or effective, or essential) when the map g — > $ s is injective; 
if the kernel of this map is discrete, we say the action is almost faithful. 

A good reference for Lie group actions is [2, Chapter 4]. As for the examples, any Lie 
group G acts on itself by left and right translations L g , R g : G — > G respectively given for 
each g G G by 

g >-»■ gg', g' »-> g'g- 

The inverse diffeomorphisms are L" 1 = L g -i and R" 1 = R g -i. This action is free and 
transitive. Also G acts on itself by conjugation: 

g >-> gg'g' 1 =■ &d(g)g'. 

This action is neither free nor transitive; it is almost faithful iff the centre of G is discrete. 

Definition 5. Suppose G acts both on iV by Q N and on M by $ M . A smooth map / : N —> 
M between these manifolds is equivariant (with respect to the actions) if / o $^ = o / 
for each g G G. The maps $ x : G — > M, where $ is a left (right) action are equivariant for 
all x, with respect to the left (right) action of G on itself and $: 

<S> x o L g = <$> g o<$> x or o R g = $ g o $ x , (A.l) 

as the case may be. 

If G acts on M, then G also acts on TM by 

(g, v x ) ^ ($ 5 x, T x <f> g v x ) =: ® T (g, v x ), for v x G T X M. 
When $ is described in local coordinates, say by 

n dh 

$ i (g,x) = h i (g,x 1 ,...,x n ), then T x $ g v x = ^ ~q~j(9i x 1 ,. . . , x n )v x . 

i=i 

Clearly the map v x i— > $ T (g, u x ) from T X M into T$(g jX )M is linear and the canonical projection 
tm '■ TM — > M is equivariant with respect to these actions: tm{^ t {9,v x )) = & (g,TM(v x )y 
We then say that $ is equilinear [28] . For vector fields, then, there is the action: 

(g 7 X)»T$ g oXo$ g -i. (A.2) 
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Corresponding to group translations we have then equilinear left and right actions of G 
on TG; as well as actions on X(G). In view of (A. 2), a vector field X L on G is left invariant 
if for all g G G, X L o L g = TL g o X L ; this means that X L is T 9 -related to itself for all 
g G G. Therefore the left invariant vector fields constitute a Lie subalgebra X L (G) =: Ql 
of X(G). Replacing L g by R g we obtain right invariant vector fields X R G X R (G) and a 
Lie subalgebra X R (G) =: Qr. In particular, X L ,X R are determined by their values in the 
neutral element: 

X\g) = TiL g X L (L~ 1 g) = 2\L fl X L (l G ); similarly A^) = T^A^le); (A.3) 

for typographical simplicity we write Ti instead of T\ G . The dimension of X L (G) or of £ R (G) 
is thus that of the group. We denote by X^, X R the left invariant, respectively right invariant, 
vector field associated to £ G T\G. The (complete) flow of Xj? is (t, g) t— > gexp(tX^) and 
the flow of is (t,g) i-> exp(tX^)g. 

We remark that is the commutant of Qr in 3£(G), and vice versa. For instance, thinking 
of the affine group of orientation-preserving transformations of the line as a neighbourhood 
of (1,0) with the multiplication rule: 

(x\x 2 ) ■ (y\y 2 ) = (AVy + x 2 ), 

then a basis for left (respectively right) invariant vector fields is 

(Xf, X 2 L ) := (x 1 d 1 ,x 1 d 2 ); respectively (X R , X R ) := (x 1 d 1 + x 2 d 2l d 2 ). 

With our Lie bracket, by the way: [X R , X R ] = X R . It is an easy exercise to check that if 
ai(x l , x 2 )d\ + a 2 (x 1 , x 2 )c?2 commutes with X±, X%, then it is a linear combination of X R , X R 
with scalar coefficients. 

Consider the tangent map Ti : TG — > TG lifting the inversion diffeomorphism % : g > g^ 1 
on the base; it carries left invariant vector fields into right invariant ones. The vector fields 
Ti o X^ and — X R o % along % coincide, that is, X^ is z-projectable on —X R . This simply 

because exp(tA^)) -1 = exp(-tX^)g. Therefore [X^,X%] projects into [X R ,X^]. 

Now, TG is itself a group, with product T\i lifted from the product fi : G x G — >• G. The 
short exact sequence (where T\G is the additive group of this tangent linear space) 

-> TiG -> TG -» G -» 1 

splits, which means TG ~ TiG X G, with TiG embedded in TG as a normal subgroup. In 
particular TG is a trivial vector bundle. We have in TG: 

gv g > = TL g v g/ ; v g >g = TR g v g >. (A.4) 

Clearly, the action of G on TiG is just Adf . Henceforth we write Ad for this adjoint action 
of G on TiG. A Lie bracket can now be defined directly on TiG by [£,r/] := ad(£)?7 := 
Ti Ad(£)r/. One could also transfer to TiG the Lie algebra structure from X L (G) or X R (G), 
say [£, r/] := [X R , X R ](1). That these and other natural definitions amount to the same is 
standard fare [89, Appendix III]. The space TiG with any of these equivalent structures is 
what people call the tangent (Lie) algebra q of G. 

Definition 6. A left (right) Lie algebra (infinitesimal) action A on M is a Lie algebra 
homomorphism (antihomomorphism) g 3 £ i— > G X(M); we say M is a g-manifold. The 
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action is said transitive at x when the A^(x) span T X M. It is furthermore primitive when the 
stabilizer q x is a maximal subalgebra; these concepts are analogous to the case of Lie group 
actions. When the action is transitive at all points of M, we say infinitesimally transitive. 
Given A : g — > £(M), if A is made up of complete vector fields (in particular when M is 
compact, guaranteeing completeness of all vector fields) and G is the simply connected Lie 
group with Lie algebra q, then there is a unique $ : G — > Diff M such that Ti<& = A. This 
lifting to a group action always exists locally. We remark as well that our choice of sign for 
the bracket of vector fields insures that the derivative Ti$ of a left action is a left action. 

For the infinitesimal description of actions, the following notion is essential. 



Definition 7. Let $ denotes an action of G on M. For £ £ g, the map (t, x) i— > $(exp x) is 

- (|> 



a flow on M. The fundamental vector field or infinitesimal generator of $ corresponding 



to £ is the vector field 



$(exp^,x) = T 1 $ a; (0. (A.5) 

i=0 



The superscript $ is omitted in the notation when the action is clear in the context. The 
image of q under T\$> x is the tangent bundle T{G ■ x) of the $-orbit. The corresponding 
differential operator is given by 



<t/(x) == | 



/($(expt£,x)). 

t=o 



The anchor map £ h- > ^f.j from the tangent algebra g to X(M) constitutes a Lie-Rinehart 
algebra; the corresponding Lie algebroid will be transitive when the action of g on M is 
infinitesimally transitive. 

For example, when <3> is L g : G — > G, we know that the corresponding flow is (t, g') i— > 
exp i£. Therefore 

ZGW) = TiR*(i = x*W), (A.6) 

the ng/rf invariant vector field associated to £. By the same token £,§(g) = X^(g). 
If M is a G-manifold, the flow of is given by $ exp ^- Indeed, 



— $(expt£,x) = — 
at as 



$(exp(s + t)£,x) 



s=0 

$(exp sf , x) o $(exp if, x) = f M o $ cxp ^(x). 



As a consequence f^ is complete. The reader will have little difficulty in verifying the 
following 

Proposition A.l. Let N,M be G-manifolds with respective actions <& N ,<& M , and f : iV — > 
M a smooth map equivariant with respect to these actions; then ~/ £m> that is Tfo£ N = 
£m ° f- More precisely, £n ~/ Cm iff the flows verify 

fo$ N r .„ = $ Af r .„ O f . 

J ^cxp(^) ^cxp(i£) J 
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Proposition A. 2. For every £,rj £ Q we have 

[€m,Vm] = [£,v]m- 
In other words: £ i— » £m is a left Lie algebra action. 
Proof. A simple calculation gives 

(Ad g £) M = T<S> g -^ M . 

We obtain the result immediately by differentiation. Our unconventional choice of sign for 
the Lie bracket of vector fields avoids the obnoxious minus signs of the usual treatments. □ 

The action <3> of G on M lifts naturally to representations of G on the various linear 
spaces associated with M — for instance to representations on spaces of sections of vector 
bundles [90] or on morphisms of vector bundles. We will limit ourselves to some simple 
cases, needed in the main text. For / £ jF(Af), we consider (g ■ f)(x) := f(^(g~ 1 , x)); then 
for T £ T\M) and for D £ D(M): 

(g ■ T, f) := (T, g^ 1 ■ f), respectively (g ■ D)f := g ■ D(g~ l ■ /). 

Invariant functions, distributions and differential operators are defined in the obvious way. 

B Differential equations on homogeneous spaces 

The problem of solving non-autonomous differential equations on homogeneous spaces of 
Lie groups is intimately linked to Lie-Scheffers theory: given an arbitrary Lie group G and 
an action of it on a manifold M, for most purposes one can restrict oneself to the orbits 
of the action, that is, the points of M/G; these are (immersed) submanifolds of M of the 
form G/G x , with G x := {g E G : <&(g,x) = x} the stabilizer of a point x of the orbit, a 
closed Lie subgroup of G. From this perspective, Lie-Scheffers systems are precisely those 
that can be rewritten in the form 

x(t) = \ m (x(t)), (B.l) 

where A : R — > q is a curve on the Lie algebra q and A denotes an infinitesimal action. If 
A = T$ for some action $ of G on M and g(t) solves the initial value problem (2.9): 

g(t) = &(t,gm <7(0) = lc, (B.2) 

then the solution of (B.l) with initial condition x(0) = Xq is given by the integrated action: 
x(t) = $>(g(t),xo)- At this point we again advise the reader to consult [17]. 

In practice we consider transitive actions on M = G/G x . Suppose that xm is a particular 
solution of (B.l) satisfying x(0) = xq. Let g\ £ Map(R t , G) such that x^(t) = $>(gi(t),xo)- 
Such curve is not unique in general; but, if g% is another one, then ^(0 = 9i{t)h(t) with h in 
Map(Rt, G XQ ). It is convenient to choose h so that g2 is the fundamental solution of (B.l): 

g 2 {t) = T.R^it), 
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upon using (A. 6) in the last equality. Then h is the fundamental solution of the Lie-Scheffers 
system associated to the curve B : M. — >• g XQ , given by [91]: 

B{t) =T l L gi{tr ,{T l R gi{t) m ~ 9iit))- 

Therefore the knowledge of a particular solution of (B.l) that satisfies X(i)(0) = Xq reduces 
the problem of finding the fundamental solution for G to finding the fundamental solution 
for the subgroup G XQ . Naturally if more particular solutions are known, whose values at are 
xi, . . . , x r , then we can reduce the problem to solving a Lie-Scheffers system in the subgroup 
G XQ fl • • • H G Xr . When this group is discrete, one can explicitly compute the fundamental 
solution for G, from which the general solution of the original Lie-Scheffers system can be 
derived. This is known as the Lie reduction method. 

A variant of the Lie reduction method was studied in the language of gauge theory in [16]. 
Without actually invoking connections, we illustrate the approach in this last reference 
with the Riccati equation (2.5). The latter seeks the integral curves of the vector field 
along tt 2 : R t x M -> M: 



For vector fields E + = 
relations 

[H,E. 



Y 



d/dx, H 



E 4 



d 

(a {t) + a±(t)x + a 2 (t)x 2 )— . 



xd/dx and E. 
[E 



x 2 d/dx we observe the commutation 



exactly those of the matrices E' + 



E-] 

1 




2H: 



[H, E_] 



-E. 



H' :-- 



E±,H realize the (perfect) Lie algebra s[(2; 
flows of R are respectively 



E' 







(B.3) 
Therefore 



of the group SL(2; '. 



The corresponding 



Xq + t; 



X 



x e 



xq 



x Q t 



the last one blows up for xq > in finite time, indicating that E_ is not complete. This 
can be corrected by adding to K. the point at infinity. More precisely, we have the well- 
known action of the projective group SL(2; M.)/Z 2 on the projective line RU oo — to wit, 
the projectivization of the fundamental action of SL(2; M.) on M 2 . Just as well, in the spirit 
of this article, we can decide to regard the action as a local one, defined on the open set 
of SL(2; 1) xl given by the pairs such that cx + d ^ 0. 

Now, consider the group Map(R t , SL{2; R)) of curves acting on the set of Riccati equa- 
tions (that is, the group of automorphisms of the trivial principal bundle SL(2; R)xK ( — > R t ) 
corresponding to the indicated action, expressed by: 



$(A(t),x(t)) = $ 



«(<) P(t)\ r(f) 

lit) 6(t) h x[t) 



a(t)x(t) + /3(t) 
7 (t)x(t) + 6{t) ' 



together with the other obvious cases. When x(t) is a solution of the Riccati equation (2.5), 
then x'(t) := $>(A(t),x(t)) is also a solution of a Riccati equation with coefficients 

7<5 — #7 
5a — a5 + /?7 — 7/3 
a/3 — (3a 



a' 2 (t)\ 


( 5 2 


—5^ 


a[(t) = 


-2(35 


a5 + (3 j 


a' {t)) 


K p 2 


—a/3 
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The second term on the right hand side is a 1-cocycle for the linear action on the co- 
efficients of the Riccati equation given by the first term. If a particular solution xru(t) 

of (2.5) is known, the element Ai(t) = ( ?) £ Map(Rt, SL(2; R)), transforms the 

original Riccati equation into the linear equation dx' /dt = (2x^(t)a (t) + a\{t))x' + a , 
thereby reducing the group SX(2;K) to the subgroup A(1;R). When a second particular 
solution X(2){t) of (2.5) is given, then x' = X(i)X(2)/(%(i) — £(2)) satisfies the linear equa- 
tion, therefore we obtain the corresponding homogeneous linear equation using the matrix 

A 2 = x (i) x (2)(, x (i) x (2)) j Concretely, the change of variables 

x" = $(A 2 ,af) = *(A 2 A 1 ,x)- (1) () 



(X( 2 ) - X(l))( x ~ 

leads to the homogeneous linear equation dx" /dt = (2x~^(t)a (t) + ai(t))x". Finally, if a; (3) 
is a third particular solution of (2.5), then z = x^(x^) —X(3))/{x(2) — x {i))( x {i) — x (3)) solves 

z- 1 ' 2 



this linear equation, thus if A 3 = ( 1/2 ) > ^ ne transformation 



x'" = $(A 3 A 2 A 1 ,x) 



[X - X ( 2))(X(i) - X( 3) ) 



(Z - X(i))(X( 2 ) - Z( 3 )) 

gives the reduced equation dx 1 " /dt = 0, which is the superposition principle (2.3) for the 
Riccati equation. 

We are not likely to find an exact solution for (B.2) in most cases. This is one reason why 
we concentrate on approximate solutions in this paper. To attack (2.8), it is generally a good 
strategy to move on to an equivalent system on the tangent algebra of G — a coordinate space 
for G which enjoys the advantage of being a linear space. To effect properly the method of 
working on the tangent algebra, one needs to ponder equivariant maps between homogeneous 
spaces. We go to this in the next appendix. 

C More on the same 

Consider again the canonical action of G on its tangent algebra g, and let / : g — > G be 
a local coordinate map. The exponential map is an example, but of course there are slight 
variants of it (see below); or we could employ, if available, the Cayley map [92]. A local action 
B* of G on q is constructed by = f~ x o L g o /. This is a (somewhat skew) generalized 
version of the CBHD map, since, if / is the exponential map, then for rj £ q we obtain: 

5 cxp (^,r/) = hg(gexprj) = log g + 77 + CBHD (log g, rj), 

with the notation of Section 8. Similarly for right actions. 

By definition the map / is equivariant with respect to B* and left translations. Since the 
maps $ x are also equivariant, their composition <& x o / : g — > G — > M is equivariant, and we 
have the following commutative diagram relating the flows on M, G and g: 
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with the notation of Section 4 for exp(££). By Proposition A.l, this commutative diagram 
can be extended to: 

Tf T<i> 

Tq^qxq — -^TG — -TM 




in particular 



(c.i; 



The overarching question is now: what is the concrete description of £ g ? This we answer 
next, and we obtain a congenial reply. Write g = f(u) with u £ g, to distinguish the role 
of the points of g as coordinates for G. The map T u f : Tg — > TG can be factorized into 
a map from T u g fh q to Q, say A^, and the translation TiRfi u )- Now, in view of (A. 6), 
T f ° £ = ° / gives 

Tii2 /(u) oAfo £ 8 ( u ) = Tii?/^) f ; 

therefore 

£ g (u) = (4)-^, (C.2) 

where A{ = Tf( u )Rf-i( u ) °T u f is the Darboux derivative of /, a map Tg -^gxg yielding the 
pullback via / of the right Maurer-Cartan form on G (a g- valued 1-form on g) . Then one 
recovers the 'static' version of (2.7) from a slightly different viewpoint. Note the double role 
of g in the construction: on the one hand, its elements are parameters of the infinitesimal 
generators on M; on the other hand they serve as coordinates of the linear space on which 
we want to solve a differential equation equivalent to the one originally given on M. The 
general Darboux derivative for group-valued maps on manifolds is a key ingredient in the 
study of connections via transitive Lie algebroids [5]. 

In summary, a differential equation on a homogenous G-manifold M — described by 
infinitesimal generators of the Lie group action along the projection l t x M -> M — has 
been transformed to a 'pulled-back' equation on the tangent algebra of the group: 



u 



(C.3) 



by means of the commutative diagram 



T<S> x oTf 

Tq -^TM 



k x g 



x M, 



where £ g is the vector field along M. t x g — > g associated to the curve t 1— > explicitly 
given by formula (C.2). The equation evolving on the Lie algebra is susceptible of attack by 
geometrical integration techniques, a point made in [93]. 

To exemplify, let us look at Riccati's equation (2.5) again. Consider 



L := a (.)E' + + ai (.)H' + a 2 {.)Ei E Map(R t ,sI(2; R)). 
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We know that if we are able somehow to solve the equation 

-£=L(t)g{t), With g(to) = 1 5 L(2;R), (C.4) 

then (2.5) is entirely solved by the 'Green operator' 

x(t) = $(g{t),x ). 

where $ is the integrated action considered in the previous section. To search for that 
solution, let us bring in a variant of the exponential map [94]. Using canonical coordinates 
of the second kind for the element g(t) G G, write: 

g (t) = f(u(t)) := exp(M (t)^)exp(M 1 (t)fT / ) exp(u 2 (t)E'_) 

=: exp(u°(t)L' ) exp(n 1 (t)L , 1 ) exp(u 2 (t)L' 2 ). (C.5) 

Therefore / denotes the defined locally bijective map from st(2;R) onto SL(2;M), with 
u = (w , w 1 , w 2 ). (Incidentally, this means that we seek the general solution of the Riccati 
equation under the form 



then, taking x = oo, 0, 1, three particular solutions are obtained, and the reader will see at 
once that the superposition formula (2.3) follows from here.) 

Replacing g{t) in (C.4) by (C.5), upon using the commutation relations we obtain 

^P-g-\t) = u°E' + tfe^+H'e-*** + u 2 e u ° E ' + e ulH ' E> e^"' e~ u ° E ' + 
dt v ' + 

= u°E' + + ii 1 exp(w° ad E' + )H' + u 2 exp(w° ad E' + ) exp(w 1 ad H')E'_ 

= u°E' + + u\H' - u°E' + ) + u 2 e" al exp(u° &dE' + )E'_ 

= u°E' + + u\H' - u°E' + ) + u 2 e~ u \E'_ - 2u°H' + (u°) 2 E' + ) 

= („o _ n o^i + ( u oy e -ui^ E , + + _ 2u Q e- ul u 2 )H' + e~ u \ 2 E'_ 

= a (t)E' + + ai (t)H' + a 2 {t)E'_. 

This leads to the following differential equations for the w-variables: 

u° = a<,(t) + ai(t)u° + a 2 (t)(u ) 2 

u 1 = ai (t) + 2a 2 (t)u° (C.6) 
u 2 = a 2 (t)e ul , 

to be solved under the initial conditions u°(to) = M^to) = u 2 {to) = 0. With the chosen 
map /, the first equation of this system is the same Riccati equation we started with. This 
we contrived to make the point again that one particular solution needs to be known, for 
the general solution to be obtainable by quadratures. The explicit form of the Darboux 
derivative in our example is 

2 

{u\u\u 2 -v\v\v 2 ) » £ Ad rfii 1 ««P(^)"* L *- 

k=0 
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The inversion of this map was just performed, with result the field corresponding to the 
system (C.6), to wit, 

(a (t) + ai (t)u° + a 2 (t)(u°) 2 )d + (ai(t) + 2a 2 (t)u°)d l + a 2 (t)e ul <9 2 , 

which is our (C.3). Once the latter equation is solved, the rest is obvious: as repeatedly 
said, one just uses the map $ o /, to go back to M. 

The perceptive reader would ask at this point: what about transferring the convolution 
algebra in Section 4 to the tangent algebra, too? We know nowadays that for conjugation 
invariant distributions this can be done [95]. 

Further work on connections a la Lie-Rinehart in the respect of Lie-Scheffers systems is 
in progress [96]. 



D Faa di Bruno Hopf algebra and the Lie— Engel theorem 

Due to its fundamental nature, the Hopf algebra we conjure next is ubiquitous. Let Diffp (M) 
be the group of orientation-preserving formal diffeomorphisms of M (similarly for C) leaving 
fixed. We think of them as exponential power series: 



/(*)=£ 4 *" with /i>°- (D.l) 

TV. 



n=l 

On Diffjj (R) we consider the coordinate functions 

a„(/):=/„ = / (n) (0), n>l. 

Now, 



k=l V Z=1 y 



where h is the composition / o g of two such diffeomorphisms. Therefore, from Cauchy's 
product formula, the nth coefficient h n = a n {h) is 

h -X^!± n] 9h "' 9lk 

n L> h\ ■■■i k \ ' 

k=l li>l,h+-+l k =n 

To rewrite h n in a compact form, it is convenient to introduce the notation 

n \ nl 



K \-kJ ' A 1 !A 2 !...A n !(l!) A i(2!) A 2...(n!)^' 

Then, taking in consideration that the sum h + ■ ■ ■ + h = n can be rewritten as 

Ai + 2X 2 + • • • + n\ n = n, where Ai + • • • + A n = k 

if there are Ai copies of 1, X 2 copies of 2, and so on, among the h] arid that the number of 
contributions from g of this type is precisely the multinomial coefficient 

k \ kl 



Ai ■ • • X n J X\l ■ ■ ■ X n l 

55 



it follows: 



n , s n 

Y,f k E ( A . A .)»i 1 ---^= : Z)/* s »^i'---'^+ 1 -*)' ( D - 2 ) 

fc=l Ahn,|A|=fc ^ ' ' k=l 



We have used notations of the theory of partitions of integers. The B nk are called the 
(partial, exponential) Bell polynomials, often defined via the expansion 

U 3m ~\ ) = 1 + ~\ Yl ukB nAdU • • • , Qn+l-k) 

m>l n>l ' Lfc = i 

which is a particular case of (D.2). Each B n ^ is a homogeneous polynomial of degree k. 

According to (4.5), a coproduct on 72.(Diffg (R)), which we realize as the polynomial 
algebra R[ai, 02, . . . ], is given by Aa n (g, f) = a n (f o g). This entails 

Aa - = XI (^\{ 1 4 2 ...a x n -®a k . (D.3) 

fe=l Ahn,|A|=fc ^ ' ' 

The flip of / and g is done to keep the tradition of writing the linear part on the right of the 
tensor product; this amounts to taking the opposite coalgebra structure. With (D.3) we have 
a bialgebra structure. In a Hopf algebra grouplike elements are invertible: g^ 1 = Sg. Since 
a\ is grouplike, to have an antipode one must either adjoin an inverse a^ 1 , or put a± = 1. 
The latter is equivalent to work with the subgroup Diff ( ]" 1 (R) of Diff^j~(R), of diffeomorphisms 
tangent to the identity at 0, that is, to consider power series such that /o = and f\ = 1. 
The coproduct formula is accordingly simplified to: 

n / \ n 

Aa n = ^2 Y ( j> u ) a 2 2Q 3 3 " " • ® Q fc = ^ -ffn,fc(l, • • • , Qn+l-k) <8> Qfc- 
fc=l Ahn,|A|=fc ^ ' ' k=l 

The resulting graded connected Hopf algebra T = 7?. cop (Diff ( |" 1 (R)), where the superindex 
stands for the opposite coalgebra structure, was baptized Faa di Bruno Hopf algebra by Joni 
and Rota [97]. The degree is then given by \a n \ — n — 1. 

Several comments are in order. Formula (D.2) can be directly expressed in terms of 
partitions of finite sets; consult [3] or [98, Chapter 5]. The happy fact that the algebra 
of representative functions 7?. cop (Diff ( ^ 1 (R)) is graded is related to the linearity of the pro- 
duct / o g in one of the coordinates. This also means that Diff J\(R) is the inverse limit of 
finite-dimensional matrix groups, and that it possesses a (necessarily unique) right invariant 
connection with vanishing torsion and curvature. Also, although 'formal' may sound a bit 
dismissive, one should remember that, in view of E. Borel's theorem, expression (D.l) does 
represent a smooth function; and that (D.2) can be used to show without having recourse 
to complex variables that the composition of analytic functions on appropriate domains is 
analytic [99]. 

Let us turn our attention to the dual of the Faa di Bruno Hopf algebra T. Since we are 
dealing with a graded connected Hopf algebra it is natural to consider the graded dual, that 
we denote simply by J-'\ for which T" = T . (From the discussion in Sections 4 and 5 we 
know there exist bigger duals, for instance T' does not have grouplike elements apart from 
its unit 77.) Let a' n be the linear functional defined by (a' n ,P) = dP/da n (0), where P is a 
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polynomial in M[a 2 , 03, . . . ]. In particular the a' n kill non-trivial products of the a q generators. 
Also, taking in consideration that the counit r](P) = -P(O) of T is the unit in T' 

(Aa' n , P®Q) = «, m(P ® Q)) = (a' n , PQ) = ^j^(O) = (a' n ® 1 + 1 ® a' n , P ® Q), 



Thus the a' n are primitive. Using the definition of the Bell polynomials 

{a' n a' m , a q ) = (a' n ® a^, Aa g ) = 



(m+n-1) if g = m + n _ 1 

otherwise. 



On the other hand, note that 

A(a q a r ) = a q a r <g> 1 + 1 <g> a g a r + a g ® a r + a r ® a g + R, 

where R is either vanishing or a sum of terms of the form b ® c with b or c a monomial 
in a 2 , a 3 , . . . of degree greater than 1. Therefore 

1 if n = q ^ m = r or n = r ^ m = q, 
(a' n a' m , a q a r ) = (a' n <g> a^, A(a q a r )} = {2 if m = n = q = r, 

otherwise. 

Similarly, since all the terms of the coproduct of three or more a q s are the tensor product 
of two monomials where at least one of them is of order greater than 1, it follows that 

( a n a mi a <3i a <?2 a <?3 ' ' ' ) = 0. 

Collecting all this together, 

a n a m ~ I I a n+m-l I Onm){Q>n a rn) ■ 
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In particular, 



r / /!_//_ / / _ / _ + 1)1 , 

fl ni a m - — a n a m a m a n — \Jn U) a n+m-l- 



Therefore, taking 6^ := (n + l)!a^ +1 , we get the simpler looking 

[C &m] = (m - n) b' n+m . (D.4) 

The Cartier-Milnor-Moore theorem implies that T' is the enveloping algebra of the Lie 
algebra spanned by the b' n with commutators (D.4). Obviously T' can be realized by the 
vector fields Z n := x n+l d/dx, for n > 1, on the real line [100]. 

Consider the 'regular representation' of T given by (a > a', b) := (a', ba) on T' . Since 

(bo (al> a'),c) = (a> a' ,cb) = (a' ,cba) = (bao a' ,c) and (lo a' ,b) := (a' ,b), 

we do obtain a left module algebra over JF. Let now a be a primitive element of T] using 
the Sweedler notation: 

(a > If a', c) = (b'a', ca) = (U <g> a', A(ca)) = (6 ; ® a', AcAa) 
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= (b' <g> a', C(i) <g> C( 2 )(a <g> 1 + 1 <g> a)) 
= (a > b' ® a' + b' <g> a > a', C(i) (g> C( 2 )) 
= ((a > b')a' + 6' (a > a'), c), 

so a acts as a derivation. In particular if a > a' = a > 6' = 0, then a > (a'6') = 0, hence the 
kernel of the map a > • is a Lie subalgebra of vector fields, and we conclude that primitive 
elements of T identify finite-dimensional Lie subalgebras of vector fields. Now, the space 
P{T) of primitive elements of T has dimension two. Indeed, P(F) = (Ml © -7~+ 2 ) , where 
T' + := ker?7 is the augmentation ideal of J 7 '. By (D.4) there is a dual basis of T' made of 
products, except for its first two elements. Hence dim P(J r ) = 2. A basis of P(T) is given by 
{«2, 03 — | a |}- This yields the equations y" = and y'y'" — 3(y") 2 /2 = 0, respectively solved 
by dilations and by the action of SL(2; K) we know; translations do not show up because we 
made a\ — 1. 

The previous argument, together with the part of classical one [101] — more recently 
rehearsed in [28, Section XIX] or in [91] — to the effect that infinitesimally transitive actions 
on the line must correspond to Lie algebras of vector fields of dimension at most three, 
shows that Riccati's is the only nonlinear Lie-Scheffers differential equation on the real (or 
complex) line. Whether or not it is simpler to think in Hopf algebraic terms seems largely 
a matter of taste. We do contend that the Beatus Faa di Bruno algebra is too fundamental 
an object to ignore. 
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